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ABSTRACT 


This dissertation contains work on three main topics. 

Chapters 1 through 4 provide complexity results for the single cut-or-join model 
for genome rearrangement. Genomes will be represented by binary strings. Let S 
be a finite collection of binary strings, each of the same length. Define M to be 
the collection of medians — binary strings w which minimize {,<9 H(u,v) where H 
is the Hamming distance. For any non-negative function f(x), define Z(f(x),S) to 
be Syem ves f(A (u,v). We study the complexity of calculating Z(f(x),S), with 
respect to the number of strings in S and their length. 

If the leaves of a star are labeled with the strings in S, then Z(z!,S) counts the 
pairs of functions where one selects a median jz for S and the other assigns, to each 
vy € S,a permutation of coordinates in which p and v differ. This relates to the small 
parsimony problem for genome rearrangement. We show that it is #P-complete to 
calculate Z(x!,S) and give similar results for other functions f(x). We also consider 
an analogous problem when the leaves of a binary tree are labeled. This is joint work 
with Istvan Miklos. 

Chapters 5 and 6 explore tree invariants. In particular, Chapter 5 examines the 
eccentricity of a vertex, eccr(v) = max,¢pdr(v,u) where dp(u,v) is the number of 
edges along the path connecting u and v in T. This was one of the first, distance- 
based, tree invariants studied (Jordan 1869). The total eccentricity of a tree, Ecc(T), 
is the sum of the eccentricities of its vertices. We determine extremal values and 
characterize extremal tree structures for the ratios Ecc(T’)/ ecer(u), Ecce(T’)/ ecer(v), 


ecer(u)/ecer(v), and eccr(u)/ecer(w) where u,w are leaves of T and v is in the 
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center of 7. Analogous problems have been resolved for other tree invariants including 
distance (Barefoot, Entringer, and Székely 1997) and the number of subtrees (Székely 
and Wang 2013). In addition, we determine the tree structures that minimize and 
maximize total eccentricity among trees with a given degree sequence. This is joint 
work with Laszlé Székely and Hua Wang. 

Chapter 6 compares three different middle parts of a tree. Different middle parts 
such as center, centroid, subtree core have been defined and studied. We want to 
provide some general insights on the difference between them and consider how far 
apart (with given order of the tree) two different ‘middle point’ can be and when 
such maximum distances are achieved. This study, after conducted on general trees, 
is naturally extended to trees with restricted degrees or diameter due to the evident 
correlation between these restrictions and the maximum distance between middle 
parts. Some related interesting questions arise that may be of interest independently. 
This is joint work with Laszl6 Székely, Hua Wang, and Shuai Yuan. 

Chapter 7 studies a problem related to Baranyai’s Theorem. This guarantees that 
whenever k divides n, there is a partition of (Ge) into rows such that each row is itself 
a partition of [n]. Baranyai (1973) used graph flows to give an existence proof for this 
118 year old conjecture. We are interested in the structure of these partitions. For 
k; = 2, there is a circular configuration which yields a straightforward construction. 
Beth (1974) found an algebraic construction for k = 3. However neither method has 
a known extension to larger k. We consider a new construction for k = 2 which makes 
use of a bijection between partitions and labeled trees. It is our hope that this type 
of connection will lead to a more general construction of Baranyai partitions. This is 


joint work with Laszlé Székely. 
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CHAPTER 1 


GENOME REARRANGEMENT 


Soon after Sturtvant (1913) developed the first genetic map, he published his obser- 
vations (Sturtvant 1921) regarding genome rearrangement in the fruit fly Drosophila 
melanogaster. Later Dobzhansky and Sturtvant (1938) took a deeper look at genome 
rearrangement, analyzing the rearrangement scenarios for two species, Drosophila 
pseudoobscura and Drosophila miranda. Palmer and Herbon (1988) focused their 
studies on genome rearrangement in plants and began the discussion about most par- 
simonious scenarios. We build upon this foundation, exploring the Single Cut-or-Join 
model for genome rearrangement. 

A genome is represented by an edge-labeled digraph where each vertex has total 
degree (sum of in degree and out degree) at most two. Each directed edge represents 
a gene (or synteny block). Each gene has a head and a tail, collectively called extrem- 
ities. Vertices of degree two are called adjacencies while vertices of degree one are 
called telomeres. Because of the degree restriction, each extremity can participate in 
at most one adjacency. 

There are many ways to represent genomes, each giving a different viewpoint of 
the structure. Here we give a representation of genomes through their adjacency 
graphs. Fix a set of genes {v1, v2,...,;Um}. Let hy, denote the head of gene v; and ty, 
denote the tail. For a genome G on this set of genes, create an adjacency graph A(G) 


where the vertices are precisely the extremities of the genes in (G): 
V(A(Q)) = ies ty.) los, bys, ara) Petes 


Connect two extremities with an edge if they form an adjacency in G. 


While each edge in A(G) represents an adjacency in G, each vertex of A(G) with 
degree zero indicates a telomere. Consequently, we may use unordered pairs of gene 
extremities to describe an adjacency and a single gene extremity to identify a telomere. 
Because each extremity participates in at most one adjacency, the degree of each 
vertex is at most one and A(G) must be a matching (possibly empty or perfect). 
Observe that the adjacency graph A(G) is uniquely defined by G. 


Given a set of m genes, create a graph A with 
V(A) = Viatsbes hos; ty, a) Paine. tie} 


and an edge set which is a matching. This graph will uniquely describe a genome on 
the given genes. It is then evident that genomes can be uniquely described by their 
set of genes and their adjacencies, for any extremity which does not appear in an 
adjacency must be a telomere. 

Next we explore the interaction of multiple genomes on the same set of genes. Fix 
a set of m genes {v1, V2,...,Um} and a multiset of n genomes G = {G1, Go,...,Gn} on 
the same set of specified genes. Draw a single adjacency graph A(G), where V(A(G)) 
is the set of gene extremities and an edge is drawn between two extremities if there 
is at least one genome in G which has the corresponding adjacency. We say that G 
has independent adjacencies if E(A) is a matching for A. 

The final characterization makes use of binary strings. Fix a set of m genes and 
a genome G on these genes. Define a binary string with ee coordinates. The 
coordinates are in one-to-one correspondence with the possible adjacencies (pairs of 
gene extremities). The genome G is then represented by the binary string which has 
a 1 in each coordinate that corresponds to an adjacency in G and a 0 in every other 
coordinate. 

The binary string representation seems to be the most concise representation, but 
it is more difficult to verify the vertex degree condition for G. However, given a set 


of m genes and a multiset of genomes G on those genes which have independent 


th to ts t4 ts 
Genome G Adjacency graph A(G) 


Binary String Representation of G 


hi ti ho te hg tg ha ta hs ts 
hi 0 0 0 1 0 0 0 0 
ti 0 1 0 0 0 0 0 0 
he 0 0 1 0 0 0 0 
te 0 0 0 O 0 0 
hs 0 0 O 0 0 
t3 0 O 0 0 
ha 0 1 O 
t4 0 0 
hs 0 
ts 


Figure 1.1 Three representations of a genome. The binary string 
representation can be obtaining by reading the rows of the array. 


adjacencies, the binary string representation is preferred. In this setting, the number 
of different adjacencies that appear in at least one genome in G is m, so we can restrict 
our binary strings at most m coordinates. The usefulness of this representation will 
become more evidence after we discuss the Single Cut-or-Join model for genome 
rearrangement. 

From here forward, every multiset of genomes will consist of genomes on the same 
set of genes. For a multiset of genomes observed in current species, we use a tree 
to represent their phylogenetic history, labeling the leaves of the tree with the given 
genomes. The most ancient vertex in the tree can be considered the root, a common 
ancestor of the genomes in the leaves. Each internal vertex represents an unknown 
species. Given a fixed tree and a labeling of its leaves with genomes, we seek the most 
likely genomes with which to label these internal nodes, according to some criterion. 
In particular, we use the parsimony criterion to determine likelihood. Before this, we 


first define a model for genome rearrangement. 


Along each edge of the tree, subsequent mutations occurred to transform the 
genome of the ancestor into the genome of its descendant. To describe these muta- 


tions, we use the Single Cut-or-Join model for genome rearrangement. 


Definition 1.1 (Feijao and Meidanis (2011)). A Single Cut-or-Join (SCJ) operation 
transforms one genome into another genome by altering the set of adjacencies in 


exactly one of the following ways: 
e Cut: replace adjacency u = {x,y} with telomeres u, = {x} and ug = {y}; 


e Join: replace telomeres uy = {x} and ug = {y} with the single adjacency 


u= {x,y}. 


The parsimony principle asserts that the true phylogenetic history minimizes the 
number of mutations that must take place along the edges of the tree. Feijao and 
Meidanis (2011) showed that the minimum number of SCJ operations needed to 
transform one genome into another is precisely the Hamming distance between their 
binary string representations. This is achieved by first cutting all of the adjacencies 
in the ancestor’s genome which do not occur in the descendant’s genome. ‘Then 
make the necessary joins to obtain the genome of the descendant. As we consider 
the possible ancestors which could label the internal nodes of the tree, we use the 
parsimony score to determine the likelihood of a labeling. For a tree T and labeling 
y of the vertices of T’ with binary strings, the parsimony score is precisely the sum 
of the Hamming distances between labelings on adjacent vertices; symbolically, 

d— = A(y(u), y(v)) 
wEE(T) 
where H(.,.) is the Hamming distance between the two inputs. The labelings with 
minimum parsimony score are called “most parsimonious labelings” and are consid- 


ered to be the most likely. 


\: 


2 3 2 3 
G G' 


Figure 1.2. Two genomes with 6 possible SCJ scenarios 
between them. 


Given two genomes on the same set of genes, in the process of one changing 
into the other, we assume that only one SCJ operation happens at a time because 
mutations are generally rare events. A time-order sequence of these SCJ operations, 
transforming one genome into another with the fewest number of SCJ operations is an 
SCJ scenario. We may note that the number of SCJ scenarios between two genomes 
is at most the factorial of the Hamming distance between their binary strings. In 
practice, there may be fewer SCJ scenarios because the digraph produced from each 


subsequent cut or join must be a valid genome. 


Example 1.2. For the two genomes in Figure 1.2, the number of SCJ operations 
needed to transform G into G’ is 4 because G has two adjacencies which are not in G' 
({ho, ts}, {t1, ha}) and G’ also has two adjacencies not in G ({ho, ts}, {ts, ha}). While 
there are two cuts and two joins that must be made, notice that the first SCJ operation 
must be a cut. 

An SCJ scenario for G and G’ must be either follow a cut-cut-join-join pattern (4 
different SCJ scenarios) or an alternating cut-join-cut-join pattern (2 different SCJ 
scenarios). If, for example, the first SCJ operation was to cut the adjacency {hz2,t3}, 
then the second SCJ operation could be to join {hz} and {ts}, but it could not be to 
join {ha} and {hs} because hg is not a telomere yet. As a result, there are there are 


precisely 6 possible SCJ scenarios that will transform G into G’. 


When we restrict our attention to a pair of genomes, {G,G’}, with independent 


adjacencies, then the vertices of an adjacency {x,y} that appears in G’ but not in G 


ee Wee Mi 
ALG, G'}) 


Figure 1.3 Two genomes with 4! = 24 different SCJ 
scenarios that will transform G into G’. The adjacency 
graph A({G,G’}) shows that {G,G’} has independent 
adjacencies. 


must appear in two telomeres, {x} and {y} in G. The same holds for an adjacency 
which appears in G but not in G’. Therefore, among the set of cuts and joins that 
must be performed to transform G into G’, performing one necessary SCJ operation 
will not affect the ability to make a different SCJ operation. Therefore, any ordering 
is possible and the number of possible SCJ scenarios is the factorial of the Hamming 


distance between the binary string representations of G and G’. 


Example 1.3. Figure 1.3 defines two genomes such that the adjacencies of {G,G'} 
are independent as indicated by the adjacency graph A({G,G'}). 

To transform G into G’, there are 3 cuts and 1 join that must be made. Because 
of independence, these operations can be performed in any order. As a result, there 


are 4! = 24 different SCJ scenarios. 


On the phylogenetic tree endowed with a most parsimonious labeling of the ver- 
tices, we will assign to each edge an SCJ scenario which details the transformation 
of the ancestral genome at one endpoint to the descendant genome labeling its other 
endpoint. We make the assumption that the SCJ scenario on one edge has no influ- 


ence over the SCJ scenario on a different edge. 


All of these definitions will be made precise in the next section. Here is a brief 
summary of the structures we will be counting. We are given a multiset of genomes, all 
on the same set of genes, and an ancestral tree T relating them. These will be viewed 
as a tree T with binary strings labeling the leaves. The first goal is to determine 
the possible most parsimonious labelings of the internal nodes of 7. For each most 
parsimonious labeling, we then label each edge of the tree with an SCJ scenario 
which details the time order that mutations occurred in transforming the genome at 
one endpoint into the genome at the other. The result is a most parsimonious SCJ 


scenario. 


1.1 MATHEMATICAL MODEL 


Now let us fix some notation and formalize a few of the definitions which were in- 
troduced above. Throughout the paper, we let [m] := {1,2,...,m}. For two binary 
strings 7 and 1/ of the same length, we let H(n, 7’) be the Hamming distance between 
7 and 7, which is the number of coordinates in which 7 and 7’ differ. Every multiset, 
S, of binary strings will have the property that no two strings in S differ in length. 
For an arbitrary binary string 7 of length n and coordinates labeled by the integers 1 
through n, we will use the notation 7[z] to denote the value of 7 in the z coordinate 
for z € [n|. For a multiset A, we define #[zx, A] to be the multiplicity of the element 
xin A. (If « ¢ A then #|z, A] = 0.) To explicitly write out a multiset, we use 


subscripts in parentheses to indicate the multiplicity of an element. For example, 
{a3), bc4), C1) } = {a, a, a, b, b, b, b, c}. 
For two multisets A and B, we define AW B to be the multiset with 
#(z, AW B] = #[z, A] + #[z, Bl). 


Definition 1.4 (Most Parsimonious Labeling). Let T be a tree with s leaves. Let 


B= {vj,v2,...,Us} be a multiset of binary strings which represent genomes with m 


genes. Let 6: L(T) > B be a surjection which assigns a binary string to each leaf of 
T. A most parsimonious labeling of the vertices of T, endowed with leaf-labeling ¢, 


is a labeling ¢' : V(T) — {0, 14) with 
e o'(l) = &(£) for each € € L(T) (i.e. ¢' extends ), 
e @'(v) corresponds to a valid genome for each v € V(T), and 


e So A(¢(u),¢'(v)) is minimum among the possible functions d'. 
uve E(T) 


Definition 1.5 (SCJ scenarios). Let 7 and 1 be two genomes on the same set of m 
genes. An SCJ scenario for the pair (n,7') is a minimum length sequence of cuts and 
joins to be performed consecutively in order to transform n into 7! with the condition 
that each subsequent cut or join creates a valid genome. 

In the binary string representation, an SCJ scenario is just a permutation of 
the coordinates in which the binary strings differ, telling the order in which the bits 
should be flipped. If the adjacencies are independent, then any permutation is an 
SCJ scenario. Otherwise, one must verify that the binary string resulting from each 
subsequent bit-flip creates a valid genome. 

Let T be a tree with most parsimonious labeling ¢'. For uw € E(T), an SCJ 


scenario for (¢'(u), ¢'(v)) will also be called an SCJ scenario for the edge uv. 


Definition 1.6. Let B be a multiset of genomes on the same set of genes. Draw the 
adjacency graph A(B) as described in Section 1. If the edges form a matching, then 


we say that the adjacencies of multiset B are independent. 


Note that if {7, 7'} has independent adjacencies, then the number of different SCJ 
scenarios for (7,7') is precisely the factorial of the Hamming distance between their 


binary string representations. 


Definition 1.7 (Most parsimonious SCJ scenario). Let T be a tree and ¢ be a function 


labeling the leaves of T with binary strings. A most parsimonious SCJ scenario for T 


and @ consists of a most parsimonious labeling ¢' and an SCJ scenario for each edge 


of T under vertex labeling ¢’. 


Definition 1.8. Given a tree T and leaf labeling ¢, let ¢' be a most parsimonious 
labeling. We say that the number of SCJ scenarios admitted by ¢’ is precisely the 


number of ways to assign an SCJ scenario to each edge of T. 


When the tree T is a star with leaf labeling ¢, each most parsimonious labeling ¢’ 
of the vertices is characterized by the binary string py it assigns to the center vertex 


of the star. We call yp a median. Formally, we define a median as follows: 


Definition 1.9 (Median). Let B = {v;}™, be a multiset of binary strings which 
represent a multiset of genomes on the same m genes. A binary string w which rep- 
resents a genome on these m genes and minimizes Diep) H(i, u) ts called a median 
for B. 

If the multiset of genomes has independent adjacencies or if a multiset of binary 
strings is given without reference to genomes, then any binary string which minimizes 


Diep) H(i, 2) ts a median. 


The next definition will fit into the context of our later proofs. Here we merely 


state the definition. 


Definition 1.10. For an arbitrary n,t € Z*, let B be an arbitrary multiset of binary 
strings of length 2n+t. We use M(B) to denote the set of all medians for B. If the 


strings of B are defined on the coordinates 


(1, Y1, La, Yo, s++5Uny Yn, C1, €2,--- se) 


we use M'(B) to denote the subset of M(B) containing only those medians 1 with 


“zi A uly; for alli € [n]. 


While it is possible for M’(B), in the previous definition, to be empty, when we use 
the definition later, most of our multisets B are defined so that M = {0,1}°" x {0}! 


and M = {01,10}” x {0}". 


1.2. COMPUTATIONAL COMPLEXITY 


While P and NP are complexity classes for decision problems, the following classes 
are for counting problems. 

The classes #P, #P-hard, and #P-complete were first defined by Valiant (1979). 
The definition for #P that we give here, while not the original, is an equivalent 


definition. 


Definition 1.11 (Welsh (1993)). The class #P contains those functions f : 3* +N, 


for some alphabet 3’, such that both of the following hold: 


e There is a polynomial p, a relation R, and a polynomial time algorithm which, 


for each input w € &* and eachy € X™* with |y| < p(|w|), determines if R(w, y). 
e For any input w, f(w) = {y= |yl S p(w) and R(w,y)}I- 
Definition 1.12 (Cook (1971)). A polynomial time reduction from one decision 


(counting) problem A to another decision (counting) problem B is an algorithm which, 


for an arbitrary instance of A, 
e runs in time polynomial in the inputs of the instance of A, 
e creates an instance of B, 


e the answer to each instance of A can be computed in polynomial time from the 


answer of the instance of B. 
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Definition 1.13 (Valiant (1979)). A counting problem is in #P-hard if there is a 
polynomial time reduction to it from every problem in #P. A counting problem is in 


#P-complete if it is in #P and is in #P-hard. 


Next we give a few known computational complexity results. 'To state these result, 
we establish some terminology. 

Closed normal form (CNF) is a standard format in which to express Boolean 
formulas. A 3CNF is a Boolean formula I" which is the conjunction of clauses and 
each clause is the disjunction of 3 literals. The symbol A is used for conjunction and 
the symbol V is for disjunction. A 3CNF, I’, with n variables {v1,v2,...,Un,} and k 
clauses takes the form [I = c; Aco A... A cy, where each c; is a clause which is the 
disjunction of three literals and the literals are from {v;}"_, U{7%j}*_,. Because I’ was 
said to have n variables, we may assume that, for each i € [n], v; or 0; appears in 
some clause of I’. Each v; is a positive literal while each 0; is a negative literal. The 
negative literal tj is the negation of v;. We identify 0; with the literal v;. We refer 
to {u;}2, as the variables of I and always assume that the set of variables has an 
ordering. 

A truth assignment for I’ is a function f : {v;}_, - {T, F} which assigns a value 
of true or false to each variable. If a truth assignment makes I" true, we say it satisfies 
I’. Otherwise, a truth assignment does not satisfy I’ in which case there is at least 


one clause which is not satisfied. 


Definition 1.14 (3SAT). Given an arbitrary I in 3CNF with n variables and k 


clauses, decide if there is a truth assignment for I’ which satisfies I’. 


Definition 1.15 (#3SAT). Given an arbitrary Boolean formula I in 83CNF with n 


variables and k clauses, count the number of truth assignments which satisfy I’. 
Theorem 1.16 (Cook (1971)). 3SAT € NP-complete. 


Theorem 1.17 (Cook (1971)). #3SAT € #P-complete. 
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Define D3CNF to be the subset of 3CNF containing only those 7 = Ajejj c; such 


that for each i € [A], 

e c; contains three distinct literals, and 

e c; does not contain both v; and Uj for any 7 € [nJ. 
This defines the following two problems. 


Definition 1.18 (D3SAT). For an arbitrary [ in D3CNF with n variables and k 


clauses, decide if there a truth assignment which satisfies I’. 


Definition 1.19 (#D3SAT). For an arbitrary I in D3CNF with n variables and k 


clauses, count the number of truth assignments which satisfy I. 
The following two results are proven through reductions from #3SAT and 3SAT. 
Lemma 1.20. #D8SAT © #P-complete. 


Proof. This is a reduction from #3SAT. Let I’ be a 3CNF with n variables and k 
clauses, n > 3. Let vq, ug, vy be literals in I with a 4 8 #7 # a. Observe that each 


of the following pairs have the same satisfying truth assignments. 


(Ua V ug V ug) and (vq V ug V Uy) A (Ua V Ug V OF). 


(Ua V Va V Va) and (Ve V ug V Uy) A (Va V BB V Vy) A (Va V Ug V By) A (Va V UB V Ty). 


Further, a clause of the form (vg V Ug V vg) is alway true, so it can be removed. 
Making these replacements in I’ will result in a D3CNF I” with n’ variables 
(n’ < n) and at most 4k clauses. Because some clauses like (vq V Ua V vg) are in I’ 
but not in I”, it is possible that n’ <n. 
Given a satisfying truth assignment for J’, we may extend it to a satisfying truth 
assignment for I in 2”-" ways. This is because the variables in I: which are not in 


I” do not affect the ability of a truth assignment to satisfy ’. On the other hand, 
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each satisfying truth assignment for I’, restricted to the variables of I’, will be a 


satisfying truth assignment for I’. a 
Lemma 1.21. D3SAT € NP-complete. 


Proof. As described in the last proof, for any 3CNF I’, there is a D3CNF I” which 
is computable in polynomial time such that I” has at least one satisfying truth as- 


signment exactly when J’ has at least one satisfying truth assignment. : 


Next, we return our attention to the phylogenetic histories which were introduced 
in Section 1 and formalized in Section 1.1. The complexity results of these next 3 


chapters address subquestions and analogues of the following problems. 


Definition 1.22 (#SPSCJ). Given a tree T and a labeling y of the leaves of T 
with binary strings, #SPSCJ asks for the exact number of most parsimonious SCJ 


scenarios. 
Lemma 1.23. #SPSCJ € #P. 


Proof. The input includes a tree T with n vertices, and a function y : L(T) > {0,1}". 
A witness is a function y’ : V(T) > {0,1} and a function which assigns an SCJ 
scenario (a permutation of a subset of |¢]) to each edge of the tree. The size of the 
input is at most O(én). 

Feijao and Meidanis (2011) gave a polynomial time algorithm to find one most 
parsimonious labeling. The parsimony score for this labeling can be calculated in time 
polynomial in the number of edges of J’ and in the length, @, of the binary strings 
labeling the leaves of 7. Then we need only compare this parsimony score with the 
parsimony score of the possible witness. If they are the same, then y’ is a most 
parsimonious labeling. For each edge, we can verify that a permutation assigned to 
an edge is an appropriate SCJ scenario for that edge in O(¢) time. By Definition 1.11, 
#S5PSCJ is in #P. : 


13 


The next two definitions are for special cases of #SPSCJ. Notice that they are 
stated without mention of genes and genomes. The problems are modeling the case 
when a multiset of genomes has independent adjacencies. Since each labeling of an- 
cestral genomes is most parsimonious, the genomes assigned to internal vertices will 
only contain subsets of the adjacencies appearing in the given multiset of genomes. 
Therefore, we restrict our binary string representations to these coordinates. Because 
the multiset of genomes has independent adjacencies, no two adjacencies being con- 
sidered share a common extremity. Therefore, a binary string, under this coordinate 
restriction, may have all ones. 

When the tree T is a star and the multiset of genomes labeling the leaves has 
independent adjacencies (Definition 1.6), we can state the following special case of 


#5PSCI: 


Definition 1.24 (#StarSPSCJ). Given an arbitrary m € Zt, let B= {v;}'%, be an 


arbitrary multiset of binary strings. Determine the value of 


S- II A(y;, p)!. 


peM(B) ie[m] 


When the tree T is a binary tree and the genomes labeling the leaves have inde- 


pendent adjacencies (Definition 1.6), we state one more special case of #SPSCJ: 


Definition 1.25 (#BinSPSCJ). Given arbitrary integer m > 2, let T be a binary 
tree with m leaves. Let B = {uy}, be an arbitrary multiset of binary strings and 
a surjective function y : L(T) + B. Define F to be the set of most parsimonious 
labelings yp! which extend y to V(T). Determine the value of 


~ TL £6), ¢))!- 


y'€F we E(T) 
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The results in the next three chapters examine #SPSCJ and some analogues for 
classes of trees such as binary trees and star trees. Most of the complexity results are 
reductions from #D3SAT. In other words, given a D3CNF I" with n variables and k 
clauses, we create a multiset of m binary strings of length 2n + t (where t and m are 
polynomials of n and k) to label the leaves of the tree. These strings will be chosen 
so that the number of most parsimonious SCJ scenarios is related to the number of 


satisfying truth assignments for I’. 


15 


CHAPTER 2 


RESULT FOR STAR PHYLOGENETIC TREES 


This chapter examines the computational complexity of #StarSPSPCJ. The first 
section details some tools and constructions that will be needed for the proof of our 
main result in Section 2.2. This main result, Theorem 2.18, states that #StarSPSCJ 


is ##P-complete. 


2.1 ENCODING A CLAUSE 


The proof of Theorem 2.18 will define a polynomial reduction from ##D3SAT (Defi- 
nition 1.19) to #StarSPSCJ (Definition 1.24). Fix an arbitrary D3CNF, I’, with n 
variables and & clauses. Fix a prime p < 5max{300,n + 5} which will be utilized 
later. We will define a multiset of binary strings D(p) to label the leaves of a star. 
This multiset will encode J’. Once the leaves of the star tree are labeled with our 
binary strings, we have an instance of #StarSPSCJ. For this instance, #StarSPSCJ 
asks us to count the number of most parsimonious SCJ scenarios. To do this, first fix 
a most parsimonious labeling and count the number of SCJ scenarios it admits (Def- 
initions 1.4 and 1.8). Then sum this quantity over all most parsimonious labelings. 

For the star, each most parsimonious labeling is identified by the median (Def- 
inition 1.9) which it assigns to the center of the star. As in Definition 1.10, we let 
M(D(p)) be the set of all medians for the star tree with leaf labels D(p). 

Our task is to define a multiset of binary strings D(p) to encode I’. The multiset 
D(p) will be chosen so that M(D(p)) will have a set of desired characteristics. First, 


each of our strings in D(p) and the medians M(D(p)) will have length 2n + t with 
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coordinates 


(fis Y1, ©2, Y2,-++5Un; Yn, €1, €2,--- ,€t) 


where n is the number of variables in I’ and the t is a polynomial of n and k which 
will be defined later. Second, M(D(p)) will be the set of all binary strings jz of 
length 2n +t that have jule;| = 0 for each 7 € [t]. In other words, D(p) will be defined 
so that M(D(p)) equals {0,1}2” x {0}*. Recall M’(D(p)), from Definition 1.10, 
is the subset of M(D(p)) with the additional property that pla;| A uly;| for all 
i € [n]. Once we have established that M(D(p)) = {0,1}°” x {0}', we can conclude 
M'(D(p)) = {01,10}" x {0}'. This allows for a connection with truth assignments 
for I’. 


Definition 2.1. Letn € Z*. For arbitrary I in DSCNF with n variables, let S be a 
multiset of binary strings on the coordinates (2X1, Y1,---;2n;Yn;€1,---,€t). There is an 
injective function f which assigns to each median 1 € M'(S) a truth assignment for 
I. In particular, f(y) will assign a value of true to the i” variable of I’ if u{x;] =1 
and false if u{x;| = 0. 


Remark 2.2. If multiset S is chosen so that M’(S) = {01,10}” x {0}', then Defi- 


nition 2.1 provides a bijection between M'(S) and the truth assignments for I’. 


Definition 2.3. Letn € Z*. Given an arbitrary DSCNF, I’, with n variables, let S' be 
an arbitrary multiset of binary strings on the coordinates (%1, Y1,---,2n; Yn; C1, -- +5 Ct) 
for some t € Z*. Define M‘,(S) to be a subset of M'(S), containing only those 
medians which, through the bijection in Definition 2.1, correspond to a satisfying 


truth assignment for I’. Since a single clause c in I’ is also a D8CNF, this defines 


M'(S) as well. 


Cc 


For #StarSPSCJ, we are asked to calculate 


ye I A(p,v%)!. 


eM (D(p)) i€[m] 


Le 


To do this, we first calculate [Tjcjmj H(u,%)! for each median wp € M(D(p)) where 
the product is the number of scenarios admitted by median y (Definition 1.8). The 
multiset D(p) will be constructed so that there is a constant K(p) (specified in 
Claim 2.22) which is with each string in M’',.(D(p)) admitting exactly K(p) sce- 
narios, and K(p) #0 mod p. Each string in M(D(p)) \ M‘(D(p)) will admit more 
or fewer than K(p) scenarios, and further []jcp,) H(u,%)! =0 mod p. As a result 
~ TL 21)! = |Mr(D(p))|K (py) mod p. 
bEM(D(p)) 1€[m] 
Repeating this construction for sufficiently many primes p < 5max{300,n + 5}, we 
obtain enough congruences, which together with the knowledge that there are at most 
2” satisfying truth assignment for I’, uniquely determine the size of M/.(D(p)) which 
is equal to the number of satisfying truth assignments for I’. 
Later we will see that the main work goes into developing a multiset D(p) with 

the property that  € M',.(D(p)) and yp! € M'(D(p)) \ M‘(D(p)) have 

I] 4G)! 4 I] Av)! 

i€[m] i€[m] 
In Section 2.1.1, we define the strings D(p) which are used in the proofs to distinguish 
M,(D(p)) from M'(D(p)) \ M'-(D(p)). 


2.1.1 ENCODING BOOLEAN CLAUSES IN BINARY STRINGS. 


A truth assignment satisfies I’ if and only if it satisfies every clause in I’. Hence, we 


will encode each clause c; of I’ in a set of 50 strings 
Coe AP Dace UeGh 


which will be defined through Table 2.1. These 50 strings, a subset of D(I’), are 
designed to distinguish those medians in M{,(C;) from those in M’(C;) \ Ml. (Ci). 


Confirmation of this will come in Section 2.1.3. Because every truth assignment 
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which does not satisfy I” does not satisfy some clause in I’, we will see that Wey Ci 
distinguishes between M/, (Wien C:) and M’ (Wie C;) \M* (Wicty Gs). 


The following definition gives a guide for defining a multiset of binary strings. 


Definition 2.4 (Defining strings). For arbitrary m,n € Z* and t € Z°, to define a 
multiset of binary strings {N1, N2,---,%m} on coordinates (211, Y1,---,Ln; Ynys C1, --+5 et); 


it suffices to 
e define nj|x~] and nj [ye] for each j € [m] and £ € [n], and 
e define a function e : [m| > Z*°. 


We say n; has e(j) additional ones. In order to infer the values 1;[{ee] for each j € |m| 


and ¢ € |t], follow this procedure: 


Partition |t| into subsets E, Ey, E2,...,Em so that the size of Ej is precisely 
e(j), and E = [t]\ Ujetmy £3. For each j € [m] and each £ € Ej, set n;[ee] = 1, 


and for j' # j, set nj[e"] = 0. 


Remark 2.5. Let m € Z*. For an arbitrary multiset {nj }ii1 of binary strings built 
using Definition 2.4, for each £ € |t], there is a unique j € [m] such that n,[ec] = 1. 
Consequently, each we M({njfF21) will have pulec] = 0 for all ¢ € [t] because pp must 


minimize DY jetm) A (nj, 1). 


Definition 2.6. Let n € Z*+ andt € Z=° be arbitrary. Two binary strings n and H 
with coordinates (11, Y1,---;Ln; Yn; €1,---,€¢), are said to be complementary on the 


first 2n coordinates if n[x;] = 1—7[xi] and nlyi] = 1—-—7lyi] for each i € [n]. 
The following fact will be useful. 


Fact 2.7. Let and 7 be binary strings on coordinates 


(is Vistas Ba agei aay Ge) 
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Set e(n) = Viey nlei], the number of additional ones inn. Define e(7) similarly. Ifn 


and 7 are complementary on the first 2n coordinates, then for any u € {0,1}?" x {0}, 


A(u,n) + H(u,7) = 2n + e(n) + e(7). 


Proof. For each i € [nl], either pu[x;] = nlx;] or ula;] = Axi], but not both. This is 
also true for each y;. This accounts for the 2n in the sum. Because ju[e;] = 0 for all 
i € |t], each 7 € |[t] with nle;] = 1 will contribute one to the sum. Also each i € [¢] 


with 7e;| will contribute one to the sum. This completes the proof. 7 


Definition 2.8. Given an arbitrary multiset of binary strings S, we say that a co- 
ordinate s is ambiguous if there are exactly si5| binary strings n € S, counted with 
multiplicity, such that n|s] = 0. Consequently, if you change the value of a median at 


an ambiguous coordinate, you obtain another median. 
Fact 2.9. Let S be a multiset of binary strings which are defined on the coordinates 
(it atk peace): 


If S can be partitioned into pairs of vertices where the two strings in a pair are 
complementary on the first 2n coordinates, then each x; and each y; is an ambiguous 


coordinate. 


Fix an arbitrary D3CNF, I’, with n variables and k clauses. Fix a clause ¢; in I’. 


For this clause, we are now ready to define a set of 50 strings 
fee tet nee 


First assume that c; = vg V vg V vy, a disjunction of three positive literals. Because 
I is a D3CNF, we may assume a < (2 < 4. 


For each j € [50], we will supply the following three pieces of information for vi: 
(a) The values for v5[o],V% [Yo], V5 [xa], Vilys], Vi [vy], V;[y_] will be explicitly defined. 
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(b) A constant «jj, € {0,1} will be given so that vila] = vilye] = Ky for all 
ULE fal \ a, By yh: 


(c) The string will be assigned some number of additional ones. 


By Definition 2.4, this is sufficient to explicitly define v\. 

The Table 2.1, there is a row for each string in C;. The three defining pieces of 
information are found in Columns (A), (B), and (C) of Table 2.1. The remainder of 
the table will be explained in Subsection 2.1.2. 

For each j € [50], row j of Table 2.1 supplies the three ingredients needed to 


define vs. By matching the 6-bit string in Column A of row 7 with 


(Yi[ro], ¥jlYal Y;lea], Yjlyal, Mjley], jr) 


we obtain the 6 values for (a). The constant «;; for (b) is found in Column (B) of 
row j. For (c), the number of additional ones in v is found in Column C of row j. 

With a slight modification in the reading of Column A, the 50 rows of Table 2.1 
will also supply the 50 strings for a clause which contains negative literals. Fix an 
arbitrary clause c; in /’ which now may have negative literals. For each j € [50], the 
definition of string vt will again be based on Columns A, B, C of row 7 in Table 2.1. 
The same information will be gleaned from Columns B and C as in the case when c¢; 
had no negative literals. The only difference is with Column A which will be explained 
next. 

Let S; denote the support set of clause c;. If ¢; contains the variables vq, ug, Vy, 
then S; := {%a,Yo,XB,Y8, Ly, yy}. Clause c; must be one of the 8 clauses listed in 
Column A of Table 2.2. For 7 € [50], vi is defined on the coordinates S; by matching 
the entry in the right column of the c; row of Table 2.2 with the 6-bit string in Column 


A of the j*” row of Table 2.1. 
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Table 2.1. The 50 strings in C; for a single clause c; along with their Hamming 
distance from medians in M’. 


A B Cc M1 M2 M3 M4 M5 M6 M7 M8 
Values of | vila], 101010 | 101001 | 1001 10 | 011010 | 100101 | 011001 | 0101 10 | 010101 
Row |v on its vilye) | Add’l 
# | support set | (ve g cq) | Ones 
= 1 0100 00 0 { n+4 n+4 n+4 n+2 n+4 n+2 LES n+2 
£5) 2 00.0100 0 n+4 | n+4 | n+2 | n+4 ] n4+2 | n+4 |] n+2 | n+2 
=| 3 | oo0001 0 43 || nta | nte | nta | nta | nto | nt2 | nse | ne 
S 4 101111 1 t n—-1 n-1 n n+1 n n+1 n+1 n-4 
to) 5 111011 i n-1|n-1]n n-1 | n+ n-1]n+1 | n4 
S 6 111110 il +0 n—1 n+1 n n—-1 n+ n+1 n n+ 
r 101000 0 +2 n n n+2 n+2 wy n+2 n+A4 n+4 
8 1000 10 0 +2 n n+2 n n+2 n+2 n+4 n+2 n+4 
9 001010 0 +2 n n+2 n+2 n n+4 n+2 e+ 2 n+4 
10 101000 0 +2 n n n+2 n+2 n+2 n+2 n+A4 n+4 
11 100001 0 +2 n+2 n n+2 n+A4 n n+2 n+4 n+2 
12 001001 0 +2 n+2 n n+4 n+2 n+2 n n+4 2 
13 100100 0 +2 n+2 n+2 n n+A4 n n+4 n+2 n+2 
14 1000 10 0 +2 n n+2 n w+2 n+2 n+4 n+2 n+4 
= 15 000110 0 +2 5 n+4 n n+2 n+2 n+4 n n+2 
S 16 011000 0 +2 n+2 | n+2 | n+4 n n+4 n n+2 | n+2 
wee) LF 0100 10 0 +2 n+2 n+4 2 n n+4 n+2 n n+2 
a 18 001010 0 +2 n n+2 n+2 n n+4 n+2 n+2 n+4 
19 100100 0 +2 n+2 n+2 n n+4 n n+4 n+2 n+2 
20 10000 0 +2 WE? n n+2 n+A4 n n+2 n+4 nm+2 
21 00010 0 aa n+4 n+4 n+2 n+A4 n n+2 n+2 n 
22 011000 0 +2 hee 2 ED n+4 n n+4 n n+2 nee 
23 01000 0 2 n+4 n+2 n+4 Kg n+2 n n+2 n 
24 00 100 0 +2 AD n n+4 nmo?Z aaa | n n+A4 hae 
25 010100 0 2 n+4 n+4 n+2 WAZ n+2 n+2 n n 
26 010010 0 +2 n+2 n+4 n+2 n n+4 n+2 n n+2 
af 00 01 10 0 +2 WED n+4 n ree. n+2 n+A4 n n+2 
28 1010 1 + n—-1 n— n+ n+1 n- n+1 n+3 r+ 
29 10110 1 { n- n n n+3 n n+1 n+3 n+ 
30 11100 1 f n- n n+3 wd n+ n—-1 n+3 n+ 
31 100 1 { n- n n n+3 n n+3 n+1 n+ 
32 101110 1 { n—-1 n+ n n+1 n n+3 n+1 n+3 
on 110110 1 + n- n+3 n mand n- n+3 n-1 n+ 
34 0110 1 + ‘oe fee n+3 n—-1 we o n n+1 nt 
35 011110 1 + n+ n+3 n+ n—-1 n+3 n+ n—-1 n+ 
= 36 111010 1 + n—-1 n+ n+ n—-1 n+3 n+ n+1 ES 
E an 100 1 t n- n n n+3 n n+3 n+1 n+ 
| 38 10110 1 + n+ n— n+ n+3 n— n+ n+3 | n+ 
2 39 11010 1 + nes n+ n+ mos n- n+ n+1 n- 
we 40 0110 1 + n+ n+ nt+3 n—-1 n+3 n n+1 n+ 
Al 01110 1 + has ab n+3 pat n- n n+1 n 
42 11100 1 + n+ n— n+3 wed n+ n n+3 n-4 
43 010 1 + nes naps n n+1 n+ n+ n—-1 n- 
44 011110 1 { n- n+3 n n—-1 n+3 n n—-1 nt 
45 110110 1 + nt n+3 n n+1 n- n+3 n—-1 n+ 
46 010 1 f n+3 n+3 n n+1 n- n n—-1 n 
47 01110 1 t n+3 n+1 n+3 n+1 n- n n+1 n 
48 11010 1 + n+3 n+1 n n+3 n n n+1 n 
Xe] 49 01010 0 + n+3 n+2 n+2 n+A4 n n n n—2 
tg 50 101010 1 eae n—1 np I n+ n+1 n+3 3 n+3 n-3 
For a clause c;, the left three columns define the 50 strings in C;. In row j, the 6-bit string gives the values of 


us on the support set S; as described by Table 2.2. The second column gives the constant value to be assigned 


to all xe and 
{01,10} is 


ye which are not in S;. The third column specifies the number of extra ones in vi The collection 


isted along the top row. The entry in row j and column £ is the number of additional ones in v% 


added to the Hamming distance between the 6-bit string in row j and the 6-bit string at the top of column c 


Ze 


Example 2.10. For an example, when c; = Va V Ug V Vy, the last row of Table 2.1 


says that the string vig must have 


CALAR ViolYal: Viol yal, Vio\ra]; ViglYy]; Vsq|24]) = (101010). 


Therefore, Vio \Lal =i, Veo (Ya = 0, Vio|ag] =O, Veolyal = 4, V5q || = 0, and 
Viglyy| = 1. Further, Column B implies vij(xe) = vi,(ye) = 1 for all £ € [n]\{a, 6, y} 


and, from Column C, vig will have 2 additional ones. 


Now that we have defined C; for any clause c;, let us analyze M(C;). By Fact 2.5, 
for every u € M(C;) and £ € [t], ulec] = 0. 

In Column B of Table 2.1, it is evident that for any @ € [n] \{a, 6, y}, the number 
of strings vt with vt [ze] = 0 is 25 = 5ICil. Therefore, by Definition 2.8, the coordinates 
xp and ye are ambiguous. Through careful inspection of the strings in Column A of 
Table 2.1, we see that coordinates x/ and y/, are also ambiguous for each ¢’ € {a, 3,7}. 


Therefore we have proven the following fact, which was one of our goals: 


Fact 2.11. For an arbitrary clause c; with three distinct variables, 


M(C;) = {0,1}°" x {0}. 


Remark 2.12. By visual inspection of Table 2.1, the binary strings C; can be parti- 
tioned into pairs where the two strings in a pair are complementary on the first 2n 


coordinates. 


2.1.2 HAMMING DISTANCES BETWEEN C; AND POSSIBLE MEDIANS 


Here we explain the remainder of Table 2.1. Fix a clause c¢; in I!’ which will be used 


throughout this subsection. Suppose c; has variables va, vg, and vy. By Fact 2.11, 
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Table 2.2. A key for interpreting Column A of Table 2.1. 


Clause Key to interpret Column A of Table 2.1 
Vo. V Ug V Vy (Vj La], Vj Ya], Yj aa], V; ya], Yj a4, V5[y]) 
Va V vg V vy | (Yi[Yal, Yi[ta], 4{lzs], Yil¥e], Yl], 4; [y4]) 
Ug V UB V Vy (Vj Lo], Yel, Yilyal, ileal, Yj ry], V5 [Y4]) 
Va V Ug V Uy (Vj Lo), Yel, Yjlza], Yilyal, Yj Yq), ¥5[@4]) 
Ta V UBV % | (Y4[Ya), ¥4[ta], Mls), H4lz8], H§ 04], Hj ly) 
TeV ug VE, | (lvalsfzal-r4lza].vfusl v4len]-v4 les) 
Va V Up V Uy (VYiltal, Yilyal, 4ilya), 4ilea], Yjlyr], Y{lz1]) 
TaV BVT, | (vilyal, vival, v'lysl, vilea),vily>),v' le) 


For any clause in the left column, the corresponding entry 
in the right column above will be matched with the 6-bit 
string in Column A of row j of Table 2.1 to determine the 
value of v at each bit in the support set 5;. 


M(C;) = {0,1}? x {0}'. Therefore, M’(C;), from Definition 1.9, must be equal to 
{01,10}”" x {0}*. For this subsection, define 


M := M(G)), M' := M'(C)). 


Define an equivalence relation ~; on M’ such that two medians are equivalent 
if they agree on the coordinates in the support set S; of c;. The result will be 8 
equivalence classes because p[x,] 4 lye] for each € € {a, 6, y} for each pw € M’. 

Here we define a one-to-one correspondence between the equivalence classes of M’ 


under ~; and the 6-bit strings heading Columns M1 through M8 in Table 2.1. 


Definition 2.13. Fix a clause c; and an integer € € [8]. Consider the 6-bit string 
6 which heads column Mé. In Table 2.2, locate the tuple in the right column corre- 
sponding to our fixed clause c;. After replacing each vt with pp in the tuple, match 


this tuple with 6. This gives six values that a median wp € M' must have if it is in the 


equivalence class represented by the column heading 6. 


In Definition 2.1, we defined a correspondence between M’ and truth assignments 
for I’. In Definition 2.3, we introduced the notation M‘(C;) for the collection of 


medians in M’ which correspond to satisfying truth assignments for I’. Similarly, we 
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defined M{, (C;) for each clause c; in I’. For the remainder of this subsection, set 
r= Mr(Ci), M., = M.(Ci): 


The following claim uses the correspondence in Definition 2.13 to connect M’ \ M‘, 


with a particular equivalence class. 


Claim 2.14. Let c; be a clause in I’. For any p € M’, ys is in the equivalence class 


represented by Column M8 of Table 2.1 if and only if we M'\ Mi. 


Proof. Fix a clause c; with variables vq, ug, vy. This clause may have some negative 
literals. We focus our attention on vg. The arguments for vg and v, are exactly the 
same. 

There are two cases depending on whether vg appears as a positive literal or a 
negative literal in c;. 

In the case where v,, appears in c; as a positive literal, the truth assignment which 
makes c; false assigns a value of false to va. A corresponding median pp € M’ has 
[vq] = 0 and ply.| = 1. Because vg appears as a positive literal in c;, the entry 
in the second column of Table 2.2 has ju[x_| followed by ju/y.]. So, in this case, the 
6-bit string which heads the column for medians in M’'\ M{, has 01 in the first two 
entries. 

In the case where vg appears as a negative literal in c;, the non-satisfying truth 
assignments for c; must have vg true. The corresponding medians pp € M’ will have 
[vq] = 1 and ply.| = 0. For the clauses with variable v, appearing as a negative 
literal in c;, a quick glance at Table 2.2 reveals that ju/y.| immediately precedes pu[xaq| 
in the 6-bit column headings in Table 2.1. As a result, the column representing 
medians in M' \ M(, has 01 in the first two entries. 

Repeating this argument for vg and v,, we see that medians in M’ \ M{, are 


represented by the column with heading 010101. a 
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Now that we have defined the rows and columns of Table 2.1, we conclude this 
subsection by defining the entries within Table 2.1 for fixed clause c;. 

Let . € M’ be an arbitrary median that falls into the equivalence class represented 
by Column Mé for some ¢ € [8]. The entry aj; in Row j and Column M6 of Table 2.1 


is H(u, vi). This value can be calculated as follows: 


e First, take the Hamming distance between the 6-bit string in Column A of 
Row j and the 6-bit string in the header of Column Mé. This is equal to the 
Hamming distance between the restrictions of js and v to the support set 5S; 


for c;. 


e For any s ¢ {a, 3,7}, ulrs] A ulys| and vi|x.] = v'[ys]. Therefore the Hamming 


distance between (y[x5], u[ys]) and (v5[xs], vi[ys]) is 1 for each s € [n]\ {a, 8, 7}. 


e Finally, because ju[e,] = 0 for all s € [t], the Hamming distance between the 
restrictions of jz and vt to the coordinates (€1, €2,..., e;) is the number of addi- 


tional ones in vt which is found in Column C of Row 7. 


Adding these three values together gives the entry aye. 


2.1.3. DISTINGUISHING THE SATISFYING TRUTH ASSIGNMENTS 


Fix a clause c; in arbitrary D3CNF I. For this subsection, we again set M’ := M’'(C;), 
Mr := MP(C;), and M!, = M{, (C;). For each p € M'\ M{,, is in the equivalence 
class represented by 010101 according to Claim 2.14. Then reading the entries in 
Column M8 of Table 2.1, we find 


{H(y, vi): 7 € [50]} = {(n — 2)ay, (n — 1)@, N@), (2 + Laas), 


(n + 2) as), (m + 3)(3), (n + 4)@), (n + 5)ayf- (2.1) 
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/ 
C. 


Otherwise, for each median p € M/., 1 is in one of 7 equivalence classes represented 


in Columns M1 through M7. The entries in each of these columns yields 


{H (p, Vv‘) 17 € [50] } = {(n _ 1) (7), 26); (n + 1) (12), (n + 2) (12), (n + 3)(6); (n + A)(7y}. 
(2.2) 


Therefore,we can use C; to distinguish between the medians in M/, and the me- 
dians in M’\ M/,. For example, given p € M’ = {01,10}”" x {0}’, if we determine 
that (n +5) € {H(u, v5) : 7 € [50]}, then we can conclude pp € M'\ My... 

Now we wish to consider all of the C; multisets together. It is clear that each 
x; and each y; coordinates will remain ambiguous in the multiset Wj) Ci. For the 
additional ones, we will take t large enough to maintain the property that, for each 
i € {t], there is at most one binary string 7 in Wieqj Ci with nle;)] = 1. As a result, 

M (y 7 S401 se TOM, 
i€[k] 
Further, 


Mi, = Me (Gi) = Mi, (y i 


i€[k] 


ri= MPC) = Mr (y | 


i€[k] 


By definition of the sets M/,, and M’, 


r= (\M,, (2.3) 
i€ [k] 
M\ Mp =M'\ 1 Mi = UM \ Mi, (2.4) 
i€ [k] i€[k] 


Therefore the multiset Weg) C; will serve as a tool to distinguish M‘, from M’\ M*. 


yas 


2.2 COMPLEXITY RESULT FOR #STARSPSCJ 


Before stating Theorem 2.18, we need a result which is equivalent to the Prime 


Number Theorem. Define 


A(x) := S > logp. 


psa 
p prime 


Theorem 2.15. 6(x) ~ a. 
As a result, the next lemma and corollary hold. 


Lemma 2.16 ( Rosser (1941) ). For 2 <2, 


( aa) 2 < 62) < (14 se 


7 log x 


Corollary 2.17. For any n > 300, 


psn 
p prime 


Now we can prove the main result for this chapter. 
Theorem 2.18. #StarSPSCJ is #P-complete. 


Proof. We have already verified that #StarSPSCJ, which is a subproblem of #SPSCJ, 
is in #P in Lemma 1.23. To show #P-complete, we give a polynomial time reduction 
from ##D3SAT. Fix an arbitrary D3CNF I = cy Aco A... A cy where each c; is a 
clause and I" has n variables. 

Using the bound in Corollary 2.17, let n’ = max{300,n+5}. Fix a prime number 


p which is greater than n’ and at most 5n’. Let 


q:=p—(n+5). 
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We will explicitly define a multiset D(p) = A(p) U Use Bi(p) U Uiepy Ci(p) con- 


sisting of 2 + 2n + 50k binary strings with coordinates 


(Las Vaasa Ua; Cid 4 CH) 
where 
t(p) := 2(q + 4) + 2n(q + 3) + k(75 + 509). (2.5) 
The coordinates €1, €2,...,€¢(p) are for the additional ones. In order to define each 


n € D(p), we will give exact values for n|x;| and n[y;| for each 7 € [n] and specify 
the number of additional ones that 7 will have. Definition 2.4 tells how to obtain the 
values of nle;| for each j € [t(p)] from this information. 

All strings in D(p) will come in pairs which are complementary on the first 2n 
entries (Definition 2.8). As a result, we can use Fact 2.9 to see that each of the first 
2n coordinates are ambiguous in D(p). 

Now we begin defining the strings in multiset that together create D(p). The set 
A(p) consists of two strings, a and @. Define a to have alx;] = aly] = 1 for all 
i € [n] and q+4 additional ones. Define a to be complementary to a on the first 2n 
entries and have q+ 4 additional ones. 

For each j € [n], the set B;(p) will consist of two strings, 3; and 3;. Define {; to be 
the string with 6;[x;] = 8;[y;] = 1 and for all 7’ € [n] with 7’ 4 J, B;[zj] = Bjly;] = 0 
and q+3 additional ones. Define 3; to be complementary to 3; on the first 2n entries 
and have q+ 3 additional ones. 

For each 7 € [k], the set C;(p) will have 50 strings. These are obtained by adding 
q more additional ones to the 50 strings in C; which were defined through Table 2.1 
(see Section 2.1.1). In other words, increase each entry in Column C of Table 2.1 by 
q to obtain C;(p). 


In summary, we have constructed the strings 


D(p) = Alp) U U Bilp) U U Ci). 


iE [n] t€[k] 
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As described in Definitions 1.10 and 2.3 and for each clause c; in I’, set 


M(p) = M(D(p)), M'(p) = M"(D(p)), 
M.,(p) = Mz,(D(p)), M((p) = Mr(D(p)). 
As stated in Fact 2.5, each w € M(p) has pi[e;] = 0 for all 7 € [t(p)]. Additionally, 
because all of the strings in D(p) come in complementary pairs, the coordinates x; 
and y; are ambiguous for each j € [n] (Fact 2.9). Thus there are 2?” medians ju. More 
precisely, 
M(p) = {0,1}?" x {0} and (2.6) 
M'(p) = {01,10}" x {O}). 
Define 
H(u,A(p)) = |] A(u,a)! 


acA(p) 
and likewise define H(ju, B;(p)) and H(ju, C;(p)) for each j € [n] andi € [k]. Therefore 


the number of SCJ scenarios admitted by z (Definition 1.8) can be expressed by 


H(u) = Hw, A(p))- TT He, Bilp)) - TT HG, Ci(p)). 


iE [n] i€[k] 


At this point, we wish to calculate d(j) mod p for each median pp € M(p). To 
analyze H(1) for each yp € M, we define the following 3 properties that a median 


pu € M(p) may have. 
Property 1. Vietn(ulvi] + uly) =n. 
Property 2. w € M'(p). 
Property 3. uw € M',(p). 


First notice that these properties are nested. Any w € M(p) with Property 2 
must also have Property 1. Likewise, if 2 has Property 3, it will also have Property 
2. The next 4 claims divide M(p) into 4 classes and examine H(jw) for medians in 


each class. 
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Claim 2.19. For arbitrary  € M(p), if 4 does not have Property 1, and consequently 


does not have Property 2 or 3, then H(z) =O mod p. 


Proof. Let ys be an arbitrary median in M(p). For a € A(p), Fact 2.7 gives 


A (pu, a) + H(p,@) = 2n+ (q+ 4) + (q+ 4) = 2p — 2. 


Hence, there is an integer r such that q+4<r<2n+q+4 and A(y,a) =r with 


H(u, A(p)) = ri(2p — 2-1). 


Since 44 does not have Property 1, we can conclude that exactly one of the following 


holds: 


A(p,a) > (n+1)+(¢+4) =p 


A(u,a) > (n+1)+(q+4) =p. 


Therefore, either r > p or (2p —2—r) > p. In the first case, r! is divisible by p and, 
in the second, (2p — 2 —r)! is divisible by p. Therefore H(u,A(p)) = 0 mod p and 


consequently (jw) = 0 mod p. = 


Claim 2.20. For an arbitrary » € M(p), if « has Property 1, but does not have 


Property 2, then H(j) = 0 mod p. 


Proof. Suppose  € M(p) \ M'(p) but uw has Property 1. Because w ¢ M'(p), 
there is an integer jo € [n] such that plx,,| = uly,j|. In the case when pu[x,;,] = 0, 
we have H(p, 8;,) = (n+ 2)+ (q+ 3) = p. Otherwise p[x;] = 1 which implies 


H(p, Bj) = (n + 2) + (¢ +3) = p. In either case, 
H(, Bj.) = pl(p — 4)! 
and consequently H(z) = 0 mod p. = 


Claim 2.21. For an arbitrary w € M(p), if w has Properties 1 and 2, but does not 


have Property 3, then H(w) =O mod p. 


dl 


Proof. Let pp be in M'(p) \ M',(p). Since yw corresponds to a truth assignment which 
does not satisfy I’, there is a clause c;, in J” which is not satisfied by this truth 
assignment. Therefore pp € M’(p) Ms (p). By (2.1), before adding the q additional 


ones to each string from C;,, we have 


{H(u, v7?) vi? € Cig} = {(n — 2)ay, (2 — 1)@), NG), (N+ Las); 


(n + 2) (15), (n + 3)(3); (n + 4) 6), (n + 5) 1) }- (2.7) 


To create C;,(p), we added gq additional ones to each string in C;, which increased 


each Hamming distance by g. Therefore 
{H(u, vj?) : v7? € Cio(p)} = {(v — Ta); (P — 8); (P — 5)(a), (P — 4)as), 
(p — 3) qs), (P — 2)(3), (Pp — DY@, Pay}: 
As a result, 

H(}1,Ci(p)) = (p — 7)!(p — 6)!°(p — 5)!P(p — 4)1°(p — 3)P°(p — 2)!°(p — 1) Mo 
which is divisible by p. Therefore H(j1) = 0 mod p. = 
Claim 2.22. For an arbitrary uw € M(p) having Properties 1, 2, and 3, the value 

H(u) = (p — 6)!"*(p — 5)!" (p — 4)17*(p — 8)1*(p — 2)! F29(p — 1H, 
which is not congruent to 0 modulo p. 
Proof. Let  € M',(p). Because it has Property 1, 
H(p, A(p)) = (n + (q+ 4))!? = (p— 1). 
Since ys has Property 2, for any 7 € |[nJ, 
H(u, Bi(p)) = (n+ (+ 8))P = (p—2)P. 
Finally, 2 satisfies Property 3 which means yp € M’, (p) for all clauses ¢; in I’. 
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Recall that each string 7 € C;(p) is created from a string 7 € C; by adding g more 
additional ones. Therefore H(u,7) = H(u,7') +q. So, the multiset H(w,C;(p)) can 


be obtained from H(,C;) found in (2.2) by adding q to each element. As a result, 
H(u,Ci(p)) = (p — 6)!"(p — 5)!°(p — 4)! (p — 8)! (p — 2)! (p — 1)!" 
Therefore 
H() = (p — 6)!"*(p — 5)!°* (p — 4)17*(p — 3) 78 (p — 2)1* 42" (yp — 1)? (2.8) 
Because p is prime, H(j) #0 mod p. = 


Set 
T(p) = D> Hw). 


HEM (p) 


Set S(p) equal to the function of p displayed in (2.8). Thus Sp) is precisely the value 
of the number of SCJ scenarios admitted by an arbitrary wp € M‘(p). If we calculate 
T(p) mod p, the four claims show that 

T(ipP)= DY Hw) = |Mp(p)|-S(p)_ mod p. (2.9) 

HEM‘ (p) 
If y is the number of satisfying truth assignments for I, then y = |M',(p)| by 
Definition 2.3. Therefore 
7: S(p) =T(p) mod p. 

Since p does not divide S(p) (Claim 2.22), there exists an integer S’(p) such that 
S(p)- S'(p) = 1 mod p. Thus 


y = S'(p)-T(p) mod p. 


While this alone is not sufficient to determine the value of 7, we can repeat this 


construction for many different prime values to obtain more congruences. 
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Recall p was fixed to be a prime greater than n’ and at most 5n’. Repeat the 
above construction for each prime pj, p2,...,Pm in this range. The result is a list of 
congruences: 

y= S"(p1)-T(pi) mod pi, 


Y= S"(p2) -T(p2) mod po, 


1 = S"(pm)-T (Pm) mod pm. 


Because p1, p2,---,Pm are all prime, the Chinese Remainder Theorem guarantees 


a solution for y which is unique modulo []j¢fmj pi. By the Corollary 2.17, 


Since y is the number of satisfying truth assignments for J’, and there are only n 
literals which can realize one of two values, y < 2". Since [Tie(mjpi = €" > 2" > 7, 
the Chinese Remainder Theorem gives the exact value of 7. 

In summary, for D3CNF I’ with n variables and k clauses, we use the Sieve of 
Eratosthenes to identify the primes between n’ and 5n’. This runs in O(n?) time. 
Then for each prime p in this interval (which is at most max{2n,600} primes), we 
create 50k +2n-+ 2 binary strings of length 2n + t(p) where t(p) is a polynomial (2.5) 
in n and p with p € O(n). Finally, the Chinese Remainder Theorem will solve the 
system of congruences in O(log(p;p2...Pm))* time (Bach and Shallit 1996). For us, 
this is O(n log n)? because each prime is at most 5n and m < 2n. 

Therefore, if we had a polynomial time algorithm to determine the total number 
of most parsimonious scenarios for a collection of binary strings, then we have created 


here a polynomial time algorithm to determine the number of satisfying truth assign- 
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ments for a D3CNF, a problem which is known to be #P-complete. This finishes the 


proof. : 


2.3. TORPIDLY MIXING MARKOV CHAIN 


In the previous section, we proved that #StarSPSCJ is a #P-complete problem. 
The natural next question is whether or not the number of most parsimonious SCJ 
scenarios can be approximated. More important, can we sample from the most par- 
simonious SCJ scenarios almost uniformly? In this way, one can test hypotheses on 
a sample random sample since there are too many most parsimonious SCJ scenarios 


to test hypotheses on all of them. 


Definition 2.23. A counting problem #A in #P has an FPAUS (fully polynomial 
almost uniform sampler) if there is a randomized algorithm such that, for any instance 
of #A and any € > 0, the algorithm outputs an element x € X, the solution space for 


#A, with probability p(x) where 


where U is the uniform distribution on X and the algorithm runs in time polynomial 


in the size of the instance of #A and — loge. 


The technique which was used to prove that #StarSPSCJ is in ##P-complete has 
been used to show that other problems are #P-complete. For example, Brightwell 
and Winkler (1991) used this technique to prove that counting the number of linear 
extensions of a partially order set is ##P-complete. For this same problem, Karzanov 
and Khachiyan (1991) found a rapidly mixing Markov chain to sample the linear ex- 
tensions. This would suggest that #StarSPSCJ also has an FPAUS. However, here we 
give a straightforward Markov chain to sample the most parsimonious SCJ scenarios 


that turns out to be torpidly mixing, suggesting that #StarSPSCJ may not have an 
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FPAUS. With evidence for both the positive answer and the negative answer, the 
question of whether or not #StarSPSPCJ has an FPAUS remains open. 

The Markov chain that we define here is stated for the more general #SPSCJ 
problem on the star tree 7’ where the multiset of genomes labeling the leaves is not 
required to have independent adjacencies. Then we prove that this Markov chain is 


torpidly mixing for a class of instances in which the adjacencies are independent. 


2.3.1 DEFINING THE MARKOV CHAIN 


Fix a multiset G of genomes to label the leaves of star tree JT. Consider the set 
of median genomes that could label the center of the star in a most parsimonious 
labeling. 

Note that if G has an odd number of strings, then there is exactly one median. 
Here we assume that the size of G is even. 

Given the binary string representations {1,V2,...,U%m} for the genomes in G, 
a median ps must minimize Dp.) H(i, uw). Each majority adjacency, an adjacency 
which appears in more than half of the genomes in G, must also appear in every 
median. Because these appear in more than half of the genomes, for every pair of 
majority adjacencies, there is a genome in G that contains both of them. Therefore, 
containing all majority adjacencies does not conflict with the requirement that a 
median is a valid genome. If an adjacency appears in fewer than half of the genomes 
in G, then it must not appear in any median. 

Among the ambiguous adjacencies, adjacencies that appear in exactly half of the 
genomes in G, medians may contain any subset of these as long as the result is a valid 
genome. Each median is characterized by its ambiguous adjacencies. If we draw the 
adjacency graph A with vertices for the extremities and edges just for the ambiguous 
adjacencies for G, then the medians are in one-to-one correspondence with the subsets 


of edges that form a matching (not necessarily maximal and possibly empty). 
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Here we define a primer Markov chain, P, to transition between the medians. As 
mentioned, the medians are in one-to-one correspondence with the matchings of A. 
So it suffices to define our Markov Chain on the state space of matchings in A. 

Define the Markov chain P. From any matching My C F(A) (corresponding to 


median Mo), we may transition to another matching with the following probabilities: 
e With probability 1/2, remain in the current state. 
e With probability 1/2, randomly and uniformly choose an edge from A. 


— If the edge is already in matching Mo, then remove it. 
(i.e. If the adjacency is in median Mo, then cut it, replacing the adjacency 


with two telomeres. ) 


— If the edge is not in matching Mp and adding the edge will extend Mp to 
a larger matching in A, then include it. 
(i.e. If the join of two telomeres of Mo will create the selected adjacency, 


then join them.) 


— Otherwise, do nothing. 


Because we remain at the current state with probability x, by definition we have 


created a lazy Markov chain. 
Observation 2.24. The primer Markov chain P ts irreducible and aperiodic. 


Proof. Every matching M can be reached from every other matching N by removing 
all edges from N one at a time and then adding the edges of M one at a time. Each 
step in this process is completed with positive probability. By definition, the primer 
Markov chain is irreducible. 

Because we have a lazy Markov chain, the probability that we remain at the 
current state is at least 1/2 and thus the period is 1 for every state. Consequently P 


is an aperiodic Markov chain. : 
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For two genomes, G and M, the number of different SCJ scenarios that transform 
M into G is notated by S(M,G). In particular $(M,G) is at most H(M,G)! where 
M and G are the binary string representations of M and G respectively. The factorial 
bound is achieved precisely when {M,G} has independent adjacencies. For a fixed 
multiset of genomes G = {G1,Go,...,Gm} and median M, the number of scenarios 


admitted by this median is defined by the function f as 


f(M) = [8.99 


i=1 


With this new function and the primer Markov chain P, we employ the Metropolis- 
Hastings algorithm (Metropolis, Rosenbluth, Rosenbluth, Teller, and Teller 1953) to 
obtain a secondary Markov chain C’ with a desired limit distribution. The states 
remain the same, but the transition probabilities are changed in the following way. 
From state M, we propose a next state M,., which is reachable from M in one 
step as described in the state space for P. If Mney is different from M, accept this 
transition with probability 


I(M) 
In other words, if Mnew was reached from M with probability P(Mnew|M), then in 


min {1 f{Mnew) : 


the secondary Markov chain C’ the transition from M to M,., will be made with 
probability 


CM I= PO ein {1 ase) | 


f(M) 


Given the function f on the medians for a given collection of genomes, we obtain a 
probability distribution @ on these medians by taking 0(M) to be directly proportional 
to f(M). In other words, 


60(M) x f(M) (2.10) 
or 0(M) =kf(M) for some constant k and any median M. 


Observation 2.25. Markov chain C is reversible and converges to the limit distri- 


bution @. 
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Proof. A Markov chain is reversible if 
O(Mi)C(Mj|Mi) = A(Mj)C(Mi|M5) 


where C'(M,|M,;) is the probability that state M, is reached from state M; in one 
step. This is trivially true if M; and M, cannot reach one another in one transition 


step. Otherwise, there is a single edge that can be added or removed from M; to 


obtain M,. Without loss of generality, we assume f we) <1< 1(Ms) Observe the 


(M5) F(Mi) 
following: 
1 1 : i) 
UM) OM | Ma) BEM a) 5 aan {t f(M,) 
i 
a f(Ma) fade 
= Brm,)! i) eG] 


Therefore the Markov chain C’ is reversible. 

As a result of reversibility, we can quickly confirm that 6C’ = @ by the properties 
of matrix multiplication. Therefore 6 is a stationary distribution. 

Notice in Markov chain C' we can reach any state from any other state. Therefore 
it is irreducible. In addition, it is finite. Because C' is lazy, it is aperiodic. These 
properties, together with the fact that a stationary distribution exists, imply that C 


has a limiting distribution which is precisely the stationary distribution. rT 


Therefore, we have a Markov chain on the state space of medians which, in the 
limit, will sample medians with distribution proportional to the number of scenarios 
each median admits. Once we have a median, it is easy to uniformly sample from the 


scenarios that it admits. 
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Now we will show that the Markov chain C is torpidly mixing (not rapidly mixing). 
In fact, it is torpidly mixing even when the number of genomes is fixed while the 
number of genes and adjacencies is allowed to grow. To prove this result, we will use 
the following: 
For any nonempty subset S of the set of medians M, the capacity of S is 
(8) = So x(q) 
ues 


and the ergodic flow out of S' is 


The conductance is 


pcan) § 1 
6: min] HS sM,0<a(5) <5}. 


Theorem 2.26 (Mélykiti (2006)). A Markov chain is rapidly mixing if and only if 


o> a for some polynomial p(n) which is not identically zero. 


Consider the following example. Select genomes Gp and G; which are represented 
by matchings Gp and G, in the adjacency graph A. Further, assume that {Go, G1} 
have independent adjacencies. More specifically, assume Go is represented by the 
empty matching and G, is represented by a maximum matching on A. Therefore 
each median for {Go, Gi} will be characterized by a subset of the adjacencies in G. 

In their binary string representations, it suffices to only consider the coordinates 
that represent adjacencies in G, since these are the only adjacencies of interest. There- 
fore, we may view Gp as a binary string with all zeros, {0}”", and G, as the binary 
string of all ones, {1}”". The set of medians will be {0, 1}". 

Consider the star tree with 2t¢ leaves, t of which are labeled Gp and the other t are 


labeled G,. First observe that every binary string in {0, 1}” is still a median. Further, 
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a median ps with exactly k ones admits (k!(n — k)!)' scenarios. The total number of 


scenarios, added over all medians, is T := )\7_5 () (k!(n — k)!)*. Therefore 


(4) = __ 


Suppose n is odd. Consider the subset S which contains all medians with at most 


[2 | ones. For this subset, the capacity is 


F(S) 


fl 
M 
pak 
se 
Q 
= 
= 


~ In (ni)? (n \&4 
"(ED 

2 1 1 

= Dre ely? 

This implies 
F(S) 1 
P< < 

eo eo) a i 00019 


Therefore, if t > 1, then as n grows, we see that @ cannot be lower-bounded by a 
function of the form aay where p is a polynomial in n. Therefore the Markov chain 


C is torpidly mixing by Theorem 2.26. 
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CHAPTER 3 


GENERALIZATIONS FOR THE STAR TREE 


In this chapter, we consider a generalization of #StarSPSCJ. 


First, fix a continuous function f : R — R. Then define the following problem: 


Definition 3.1 (#StarSPSCJ(f)). Given an arbitrary m € Z*, let S = {y,}™, be an 


arbitrary multiset of binary strings. Determine the value of 


1s II (AY, 1). 


BLEM(S') i€[m] 


3.1 CALCULATING #STARSPSCJ(f) EXACTLY 


In the previous section, we showed that #StarSPSCJ(f) is #P-complete when f(z) 
is the function x!. Here we work toward the determining the computational complex- 
ity of #StarSPSCJ(f) for various functions f. First, we formalize a definition and 


develop a couple of tools. 


Definition 3.2. A function g: R > R its strictly concave up if for any x,y,z € R, 


HOR WZ, 
g(z) — g(2) 


sae 


Equivalently, g'(x) is a strictly increasing function. 


Lemma 3.3. /f log f(x) is a strictly concave up function, then for any x < y and 


a>0O, 
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Proof. By the intermediate value theorem, there are real values c,d with c € (x—a, x) 


and d € (y,y +a) such that 


(log f)'(c) = 
(log f)'(d) = 


(log f(x) — log f(x —a)) and 


(log f(y + a) — log f(y)). 


SlrRalrR 


Because log f(x) is strictly increasing, g'(c) < g’(d). Therefore, 


“(log f (2) — log f(x —a)) < “(log f(y +a) — log f(y)) 


log f(x) — log f(x — a) < log f(y + a) — log f(y) 
f(z) fly +a) 


< log 


f(x —a) fy) 
f(@) fyt@ 


< 


f(x —a) f(y) 


log 


Fact 3.4. Fir k € Z*°. Let f(x) be a function such that log f(x) is strictly concave 


amin, af) =f ({5]) j ({5]) | 


a+b=k 


up. Then 


Proof. Let x = |E| and y = [§]. By Lemma 3.3, f(x —a)f(y+a) < f(x) f(y) which 


2 


gives the desired result. : 


Theorem 3.5. Fir a function f(x) : Z=° + R=° which satisfies the following prop- 


erties: 
e log f(x) is strictly concave up, 


e the function values of f can be computed in polynomial time, and 
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e for all but finitely many n € Z, n > 2, 


f(n = 2)[f(n + IPF (n+ 2)P fm + 5) 


For arbitrary m,s € Z*° and D ER, let S := {1%4,%,...,U¥m} be a multiset of binary 
strings, each of length s. Then it is #P-complete to determine how many medians [u 


for S have 


I] f(A) < D. (3.1) 


Proof. Fix a function f(x) with the properties listed in the theorem. 

If is straightforward to see that #StarSPSCJ(f) is in #P. Fix an instance consist- 
ing of integer m, s, real number D, and a multiset S of binary strings of length @. Let 
i be a binary string of the same length as each v;. We can verify that y is a median 
in time O(mé). Each H(v;,) can be computed in time O(/). Because H(1j;, w) < &, 
we can compute f(H(v;, 4) in time polynomial in the size of the input by the condi- 
tions on f. Finally, checking if the product is at most D is also a polynomial time 
calculation. Therefore #StarSPSCJ(f) is in #P. 

To prove NP-hardness, we will provide a reduction from #D3SAT. For I’, a 


D3CNF with n variables and k clauses, set 


K(n) =[f (n)]*[f(n ite 19 lead ce a Deere? 


[f(a + 3) [fn + YPM (n + 5)" 


The idea is to define a multiset, D, of binary strings with the following properties: 


e Each median which corresponds to a satisfying truth assignment for I” will have 


II f(4(7, 0) = 


nED 
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e Each other median will have 


I] f(A, #) > &. 


nED 


For arbitrary n,k € 27°, fix [ =c; Aco A... Ac, be an arbitrary D3CNF with 
n variables. 


Create a total of 158k + 28kn strings of length 2n + 260k + 35kn with coordinates 


(x1, Y1, ©2, Y2,-++5Un; Yn, €1, €2,--- ,€t) 


where t = 260k + 35kn. 

This multiset of binary strings will be defined as the union of three multisets: 

D=A¥ WY Bw YC. 
i€[n] i€[k] 

As in Definition 2.4, we will define each string 7 € D by explicitly giving the values 
of n[x;| and n[y;] for each i € [n] and telling the number of additional ones in 7. 

The collection A contains 108k strings. For a € [t], let a be the string with 
alxz;] = aly;| = 1 for all 1 <i < n and a additional ones. Define at” to be the 
binary string which is complementary to a‘+°) on the first 2n coordinates and has a 


additional ones. The multiset A will consist of the following strings: 
e k copies each of at and a), 
e 8k copies each of a(t) and a), 


18k copies each of a‘+?) and a), 


18k copies each of a‘) and a), 
e 8k copies each of at and a, 


e k copies each of at) and a), 
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The collection B = Wicjn) 8B; contains 28kn strings. For each i € |n], a € [#], let 


8° be the string with blz, = Bly] = 1, Bilz;] = Bily;| = 0 for 7 #4; and with 


a additional ones. Define the binary string Bt to be complementary to Bo) on 


a 


the first 2n coordinates and have a additional ones. The collection 6; consists of the 


following 28h strings. 


e k copies each of poy and Be, 


e 6k copies each of Bt) and Bt 


(+3 
Bi ) 


a ’ 


e 6k copies each of Bi) and 
e & copies each of Bs) and hase 


The collection C = Wie) C; contains 50k strings. Each set C;, which is associated 
with clause c;, consists of 50 strings. In Section 2.1.1, we defined set C; through 
Table 2.1. For each v € C;, create D by increasing the number of additional ones in 


v\ by one. Then 


Let M be the set of all medians for D. From Definitions 1.10 and 2.3 and for each 


clause c; in I’, set 
M := M(D) M' := M'(D) 
M,, = M,,(P) r= M,(D) 
According to Remark 2.4, all medians ys must have ple;] = 0 for all 7 € [¢]. In A, 
B, and C, the strings come in pairs where one is complementary to the other on the 


first 2n coordinates. By Fact 2.9, each of the x; and y; coordinates are ambiguous. 


Therefore 


Maire tor, 
Mi = 101,101" « {0}. 
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Define 
H(u,A) = [J f(A (u,¢)). 
acA 
Similarly define H(y,B;) and H(p,C;). Set 


H(u) = H(u, A) : II H(u, B;) : I H(y, 4): 


iE[n] E(k] 

For each yp € M, we obtain a lower bound for H(j) and for each wp € M‘, we 
describe an exact value for H(j1). Divide M into 4 classes using the following three 
properties which a median pp € M may have. 

Property 1. Viejn\(ul@i] + ulyi]) = 7. 
Property 2. we M’. 
Property 8. we M'. 

Notice that these properties are nested. Any median pp € M with Property 2, 

must also have Property 1. Further, any ~ © M with Property 3 must also have 


Property 2. The following claims provide lower bounds for medians according to 


their properties. 
Claim 3.6. If 1. € M has Property 1, then 
H(ss, A) =[P(m))>*Ef(m + 1)P [f(r + 2)]°% 
[Fm + 3)**[F(n + 4)P [f(r + 5))* (3.2) 
Otherwise, 
H(u, A) D[f(n— DF (n + ELF) Fn + QE [F(n + 1) f(n + 3)]* 


[fn +2)f(n +4)" f(n + 3) f(r +5)" [Fn +4)f(n +6)" (3.3) 


=! Qbad: 


AT 


Proof. If 4 € M has Property 1, then H(p,a%) = H(p,a*) = n because 
atx] = aly] = 1 for all i € [n] while y only has n ones in the first 2n 


entries. Because ju[e;] = 0 for all 7 € [t], by Definition 2.4, 
H(u,a%) = H(p,a) =n+a. 


Recalling the exact strings that appear in A, we quickly obtain (3.2). 

If ys does not have Property 1, then either jz has more than n ones in the first 2n 
entries, implying H(y,a°)) > n, or p has less than n ones in the first 2n entries, 
implying H(p,a) > n. By Fact 2.7, H(u,a¢) + A(p,a) = 2n. By Fact 3.4 


and the above observations, 


f(A (u,e%))- f(A, a) > fr—-DF(n+)), 


- (H (ua) -f (A (u, cx") | > f(n-—l+a)f(n+1+a). 
Recalling the exact strings in B, we obtain the lower bound in (3.3). : 
Claim 3.7. For each uw € M and each i € [nl], 
H(n, Bi) =[f(r + DP LF (n + 2)" [f(m + 8)" [f(r + AP* =: Bgooa- (3.4) 
If has Property 2, then for every i € [nl, 
H(p, Bi) = PGovils 


If satisfies Property 1, but not Property 2, then there exists ig € [n| such that 


H(u, Bio) =[f(n — I f(r + 3) (f(r) F(n + 4)]™ 
[f(r + Uf (n+ 5)" UF (n + 2) f(n + 6] (3.5) 


= Brad: 
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Proof. For any pp € M, by Fact 2.7, 
H(u, 6%) + H(u, BO) 
By Fact 3.4, for each a € Z2°, 
fH (u,8%)) -f (4 (uBO)) = Lemp, and 
f(H(u.8%))-£(H(H,8°)) > eta). 


Therefore, for any pp € M, 
H(t, B;) = Booed: 
If uw € M’, then for each i € [n], u[x;] A uly;|. On the other hand, for each 7 € [nl, 
7 € [nl], pra) ies = Be [y;]. Therefore for any 7,7 € [n], 
H((u{e4), lus), (Ge bes], (ys) = 2. 
The same holds if 6{*” is replaced with ae Therefore, 
H (y, AS) =H (w,80) =n, 


Dh Be (x. By”) =n+a. 


As a result H(w) = Bgooa: 
If yw satisfies Property 1 but not Property 2, then we can define a tighter lower 
bound on H(i, B;). In particular, because p ¢ M’, there exists i9 € [n] such that 
(+2) 


plvio] = Ulyigl- Recall B+ [xi,] = BF [yig] = 1 and BS (xi) = BS [yig] = 0. 


Therefore, 


L[zig] =1 =A ((u Lig}, HYio Js (a ig a Yio )) = 0, 


a) ( 
A((u Lig}, HYio 5 (85, Vig arn Vio )) = 2, and 


a) ( 
Lig] =0 => A((u Lig}, HYio J (Bi, Tig mera Vio )) = 0, 


H((ul2io], UlYiol), (BS [zie], Bat viel) = 2- 
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Because pu satisfies Property 1, there are exactly n ones among the first 2n coor- 


dinates. Without loss of generality, u|x;,| = ulyi,] = 1. Set 


S = {2j,y; 29 € [n],7 A to}. 


(+a) 


Then ps has n — 2 ones and n zeros among the coordinates in S. However, §;, 


takes the value 0 on each of the coordinates of S and ae” takes the value 1 on the 


coordinates of S. Therefore, 


[rio] = =HA(, pa) =0+ (n i 2), 


to 
A(p, a) =2+ n, and 


pli] = 0 +H (u, Bo) =0 + (n—2), 


20 


H(p, Br) =2+n. 


to 
As a result, 


H(p, 6) H(u, By”) = (n—2)(n +2), 


H(u, 85°) H(u, BO”) = (n-2+a)(n+2 +a). 


Taking into account all binary strings in B;,, we conclude H(i, B;,) = Braqin (3.5). 


Fact 3.8. For the quantities defined in Claim 3.7, Bgooa < Baa. Consequently, if 


ue M\M' and satisfies Property 1, then Tlie H(u, Bi) > BoadBn soy. If uw € M\M! 


goo 


and does not satisfy Property 1, then Ticfny H(H, Bi) > Brood: 


Proof. Observe 
iat =F (n)Pf(n $14 F(n +4) f(n + 5) F(n + =| 
Daeeu f(n t 2) f(n 7 3)l 


fm I f(n + ay. | f(n)f(n +5) ly fe +1) f(n+ ali 
f(n+2)f(n+3)| (| f(n+2)f(n+3)] [f(r +2)f(n +3) 


>1 


where the last inequality follows from Lemma 3.3. a 
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Claim 3.9. For any  € M and for each j € |kl, 
H(u,C;) > [f(n + 2)]?[F(n + 3)? = Ymin- 
If we Mp, then for each j € [kl], 


H(u, Ci) =m)" + DIF (n + 2))7TF (n+ 3))P LF (n + APTA ( + 5))" = Ygoo: 


(3.6) 
Tf we M'\ Mt, then there exists ig € [k] such that 
H(u,C;,) =f(n — YF (n)PUF(n + DPF (n + 2)” 
[fr + 3))°[F(n + PTF (m + 5) F(m + 6) (3.7) 


=: Ybad- 


Proof. Let be an arbitrary median in M. By Remark 2.12, the binary strings 
in C; come in pairs that are complementary on the first 2n entries. With a careful 
examination of Table 2.1, if 7,7’ € C; are complementary on the first 2n coordinates, 
then e(7) + e(n’') = 3 where e is the function specifying the number of additional ones. 
By the definition of C/, the strings still come in complementary pairs, (7, 77’), but here 
e(#) + e(f’) = 5 because the number of additional ones in # and 7 is precisely one 


more than the number in 7 and 7’. By Fact 2.7, for each of the 25 pairs in Ci, 
A(u,h) + H(p, 7) = 2n +5. 


Then by Fact 3.4, 


f(A(e M)F(A(u, 9) 2 f(r + 2) f(n + 3) 


which gives the general bound jn. 


51 


Now suppose p € Mp. This implies p € M(, for all clauses ¢; in I’. By the 
definition of Cj, for each 05 € Ci, H(u,0;) = H(u,v;) + 1 where vj € C;. From (2.2), 


We See 
{H(u, 05) : 7 € [50]} = {nz, (n + 1)@), (n + 2) ayy, (N+ 3) a2); (n+ 4)@); (n+ 5) pH. 


This immediately implies H(w,C)) = Ygooa in (3.6). 

Finally, suppose uw € M’\ M‘.. Using the bijection in Definition 2.1, 4 must 
correspond to a truth assignment which does not satisfy I’. So there is a clause c;, 
in I’ which is not satisfied. Therefore p € M'\ Mj, . From (2.1), adding 1 to each 


A(p,v;°) to obtain H(y,0;°), we obtain 


{H(p, vi) > 7 € [50]} = {(n — 1), n@), (2 + 1)(ay, (n + 2) 15), 


(n + 3) 15), (nm + 4)(3), (n + 5)(6), (n + 6) (ay fe (3.8) 
This directly implies H(u,C;,) = Youd in (3.7). r] 


Fact 3.10. For the quantities defined in Claim 3.9, Ygooa < Yoad- AS a result, when 
wEeM'\ Mr, 


H(u,C) > Yaa Yroou: 


Proof. Indeed, this was our initial assumption: 


nd _ fn =F M+ DPF + 3)PF(n+6) | 


Yooor = F(M[F(rn + DPF (mn + 4)PLF(m + 5)] 
The bound for H(yu,C) results from the fact that  € M’ either corresponds to a 


satisfying truth assignment for c; or a non-satisfying truth assignment for each clause 


Cj. | 


In summary, Claims 3.6, 3.7, and 3.9 along with Facts 3.8 and 3.10, we give the 


following bounds. If uw € M‘, 


Hy) Fi good trait Vivod =: hg. 
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If we M'\ Mi, 
H(t) > Algood? rod’Ybad Vyood =: he. 
If w € M \ M' and has Property 1, 
H(t) > AgoodBraaS rod Ymin =! M1. 
If « € M but does not have Property 1, 
H(u) = Seale sila ee 


In order to complete, the proof, we only need to show hg < h; for 7 € {0,1, 2}. 


By one of our assumptions about f(x), we have already verified in Fact 3.10 that 


Next observe 


k 
hy = Boaa Ymin 
~ : k—-1 
ho Bgood Yoad’Y good 
k 
> Boad _ Yimin 


k 
ie) good Yoad 


f(n—1)f(n +3) ofa) fe Lf(n+ a f(n +2)f(n+6)]” 
[f(n + D/P? [f(n + 2))? [f(n + 3)]? [f(n + 4)? 


[f(n +2) [fn +3)] | 
f(n— DEF) LF (n + DEF (n + APL (n + 5)]8f(n + 6) 


| 
+f 


n+1 
n+2 
>1 


where the last inequality follows from Lemma 3.3. 
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Finally we prove that ho > hg. 


is 
ho — Qbad Ymin 


k-1 
he OQ good YoadY good 


k 
Qbad YVmin 


> k 
Qa good Yboad 


_ a a ee [foe aie 9)" 
[f(n + 1)? [f(n + 2))? 


| teense) [ie ee ‘ 
f 


) 
[f(r + 3))? [f(n + 4))? [f(n + 5)) 


| | [F(n + 2) f (n+ 3))° 
F(n— DUFF 


Hite apa TSO TO 


f(n + 4))*[F (r+ 5))*[f(m + 6)]* 
PLPC + VPS F(n + 5)]?% 


, [F(n + 2)]*[f(n + 3)]1* 
[f(r — 1)[F (rn) |*[F(m + 1) lf (nm + 4) [fF (n + 5)/* Fl + 6)* 


=1. 


Therefore for any fj € M’, and wp € M\ Mj, then H(i) < H(yu). Thus, if we 
could determine, in polynomial time, how many medians pp € M have h(i) < hg, then 
we could determine how many satisfying truth assignments exist for I’ in polynomial 


time. = 


Corollary 3.11. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave up, 
e the function values of f can be computed in polynomial time, and 


e for all but finitely many n € Z, n > 2, 


f(n = 2)[F(n + IPF (n+ 2)P fm + 5) 
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For arbitrary m,s € Z*° and D € R, let S := {4,1,...,Um} be a multiset of 


binary strings, each of length s and let M be the set of medians for S. Then it is 


NP-complete to determine if 


min [] f (#(%,n)) < D. (3.9) 


This next theorem gives the same result as Theorem 3.5 with one change in the 


conditions on f. While Theorem 3.5 required that 


f(n— 2)[F(n + IPF (n+ 2)P fm + 5) 


fm—-Di@PE@+3efn+s 


Theorem 3.12 switches the inequality to consider functions in which the ratio is less 


than 1. 


Theorem 3.12. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave up, 
e the function values of f can be computed in polynomial time, and 


e for all but finitely many n € Z, n > 2, 


f(n = 2)[f(n+ DPF + DP f(a + 5) 
f(n—DIFM)PF(n + 3) PR f(n + 4) 


<i 


For arbitrary m,s € N and D ER, let S := {1%,%,...,Um} be a multiset of binary 


strings, each of length s. Then it is #P-complete to determine how many medians [u 


for S have 
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Proof. This proof closely mirrors the proof of Theorem 3.5. Here we will note the 
changes that need to be made. 
This time, we define 98k + 24kn binary strings, each of length 2n + 245k + 60kn 
with coordinates 
(op; Yin <2 las Uns Cian expe) 
Let at® and a(t be defined as before. The collection A will now consist of the 


following 72k strings: 
e 4k copies each of a) and a), 


e 14k copies each of a?) and a), 


e 14k copies each of at?) and a), 


e 4k copies each of at” and a4), 


Define B®) and on as before. The collection 6; now consists of the following 


24k strings: 


e 6k copies each of Bi?) and ar ; 


e 6k copies each of ge) and a. 


Following the explanation found in Section 2.1.1, Table 3.1 defines 26 binary 
strings C; for a clause. As in the proof of Theorem 3.5, we will add 1 additional one 
to each of the 26 strings in C; to create C/. 

Using the same Properties 1, 2, and 3 as before, we obtain the following values 


which are analogous to the bounds in Claims 3.6, 3.7, and 3.9: 


Agood =[f (n + 1)]§*[F(n + 2)]*[f (nm + 3)]?*[f (n + DI, 
Mad =[f(n) f(n + 2)" [f(r + Lf(n +3) 


[f(nt+ 2 fn +4] [f(r + 3) f(n +5), 
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Boooa =[f(n + 2)]*[f(n + 3)]™, 
Brin =[f(n + 2)]**[f(n + 3)]*, 
Baa =[f(n) f(r + 4)" [F(n + f(r + 5)", 
Yoood =F (n — 1[F(n)P LF (mr + YP LF (m + 2))° 
[f(r + 3)Pf(n + PF (n + 5)Pf(n + 6), 
Yad =[F(n)}*[f (rn + DI] (n + 2)P [Fm + 8))°[F (n+ 4) LF (nm + 5), 


‘min =[f(n + 2)])*[f(n + 3)]*. 


By our assumption about f(z), 


Yeod _ fF (nFrn+ HP F+4aPfnt+s) oy 


Yoooa f(r — 1[F(n + 2)/°[ f(r + 3) f(r + 6) 


which implies Ypaa > Ygood- 
Next we determine the values of ho, h1, hz, h3 in this setting. As before, if w has 
Property 7, but not property i+ 1, then H(u) > h;. Further, if 4 has Property 3, 


H() = h3. If w does not have Property 1, then H(j1) > ho. 


eee n k 
hg = OlgadP pood Veobd 
fad n k-1 
ho — OgoodP good Yoad Y good 2 
faa n-1 k 
hy — Open bad aire Ymin- 


— n k 
ho — ade date * 


As in the proof of Theorem 3.5, we will show h3 < ho, h4, ha. 


By our assumption about f(x), 


hy _ Yoad 


—_— = ls 
hg Ygood 
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Also 


hy = Bead ? Nei 
he Based Yoad Vood 
Boat Ymin 
pea WG 
= | [Aine d) [fees ore +5) 
[f(n + 2)? [f(n + 3)? 


k 


. | [f(n + 2)}°LF(n + 3)]?° | 
[f(n)ELf(n + DEF (n + D]8[f(n + 5/4 


-| (n) f(n +5) |" 
f(n+2)f(n4+ 3) 


>1 by Lemma 3.3. 


Finally we show hp > ho. 


he 
ho i Qbad Ymin 


k-1 
hg good YoadY good 


k 
= Qbad . Ymin 


k 
Agood Ybad 


7 ey fe nite)" 


LL [f+ iP [f(n + 2)? 


f(n +2)f vif] ee 
fin+3)2 [Fn + +P 


a 
la f(n +2)]"[f(n + 3)]° 
( 


Nal ert [Ff (n + 4)]° ary 


_ Lf) Ifa + DI Ef (m+ 2) * LF (nm + 3) "1 F (n+ 4 Lf (n + 5)]™ 
[f(r + DF (nm + 2)P°*[F(n + 3) [f(r + 4)]°* 


| [F(a + 2)]!°*[ F(a + 3)]!0* 
(Fn) "Efe + DIF (n+ 4) (Flr + 5) 


= 


Making each of these changes in the proof of Theorem 3.5, we complete the proof of 


Theorem 3.12. = 
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Table 3.1 The 26 strings to complement the collection in Table 2.1 along with 
their Hamming distance from medians in M’. 


A B Cc M1 M2 M3 M4 M5 M6 M7 M8 
Values of vilael, 101010 | 101001 | 100110 | 011010 | 100101 | 011001 | 010110 | 010101 
Row | vj on its vilye] | Add’l 

# | support set | (ve Z c;) | Ones 

1 010000 0 0 n+1 n n n-1 n+1 n-1 n n—-1 
2 0001 00 0 0 n+1 n n n+1 n-1 n+1 n n-1 
3 00.000 0 0 n+1 n n n+1 n—-1 n—-1 n n—-1 
4 10111 1 3 n+2 n+2 n+2 n+4 n+2 n+4 n+4 n+4 
5 11101 1 3 n+2 n+2 n+4 n+2 n+4 n+2 n+4 n+4 
6 111110 1 3 n+2 n+4 n+2 n+2 n+4 n+4 n+2 n+4 
7 010100 0 2 n+4 n+4 n+2 n+2 n+2 n+2 n n 

8 01000 0 2 n+4 n+2 n+4 n+2 n+2 n n+2 n 

9 00010 0 2 n+4 n+2 n+2 n+4 n n+2 n+2 n 

0 10101 af n—-1 n n n+1 n+1 n+1 n+3 n+3 

1 10111 1 n—-1 n n n+1 n+1 n+3 n+1 n+3 

2 11101 1 n—-1 n n n—1 n+3 n+1 n+1 n+3 

3 10010 0 n+2 n n n+A4 n—2 n+2 n+2 iD 

4 01100 0 n+2 n n+4 n n+2 n—-2 n+2 n 

5 01011 0 n+2 n+4 n n n+2 n+2 n—2 n 

6 10100 0 + n n—2 n+2 n+2 n n n+4 n+2 

7 10011 0 + n n+2 n—2 n+2 n n+4 n n+2 

8 01101 0 + n n+2 n+2 n—2 n+4 n n n+2 

9 10101 0 + n—-2 n n n n+2 n+2 n+2 n+4 
20 01010 1 2 n+5 n+3 n+3 n+3 n n n n—-1 
21, 10010 1 2 n+3 n+1 n+1 n+5 n n+3 n+3 n+1 
22 01100 1 2 n+3 n+1 n+5 n+1 n+3 n n+3 n+1 
23 01011 1 2 n+3 n+5 n+1 n+1 n+3 n+3 n n+1 
24 10100 1 2 n+1 n-1 n+3 n+3 n n n+5 n+3 
25 10011 1 2 n+1 n+3 n-1 n+3 n n+5 n n+3 
26 01101 1 2 n+1 n+3 n+3 n—-1 n+5 n n n+3 


The information in this table is to be read in the same way as the information 
in Table 2.1. This is detailed in Section 2.1.1. 


Corollary 3.13. Fir a function f(x) : Z=° — R° which satisfies the following 


properties: 
e log f(x) is strictly concave up, 
e the function values of f can be computed in polynomial time, and 


e for all but finitely many n € Z, n > 2, 


f(n=2)[f(n+ DPF (n+ 2)Pf(n +5) 
f(n—DFM)PF(n + 38) PF f(n + 4) 


For arbitrary m,s € Z?° and D € R, let S := {14,12,...,U¥m} be a multiset of 


binary strings, each of length s and let M be the set of medians for S. Then it is 
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NP-complete to determine if 


min [] f(H(vi,4)) < D. (3.10) 


iE[m] 


We can also state the following corollaries for functions which are strictly concave 


down. 


Corollary 3.14. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave down, 
e the function values can be computed in polynomial time, and 


e for all but finitely many n € Z, n > 2, 


f(n = 2)[F(n + IPF (n + 2)P fm + 5) 


fin —Dif()Bf(n +3) 2 f(n +4) Al. 


For arbitrary m,s € Z*° and DER, let S := {1%4,%,..., Um} be a multiset of binary 


strings, each of length s. Then it is #P-complete to determine if how many medians 
L for S have 


I] f(4™,u)) = D. (3.11) 


tE[m] 


Proof. If the function f(a) has the property that log f(x) is strictly concave down, 
then log Fa) is strictly concave up. Therefore by Theorems 3.5 and 3.12 for the 


function GE it is #P-hard to determine the number of medians jz which satisfy 


Tie{m] K Te? a < S: This is equivalent to asking for the number of medians jz have 
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Corollary 3.15. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave down, 
e the function values of f can be computed in polynomial time, and 


e for all but finitely many n € Z, n > 2, 


f(n—2)[fln+ DPF (n+ 2) f(n +5) 
f(n—DFM)PF(n + 3) PF f(n + 4) 


#1 


For arbitrary m,s € Z7° and DER, let S := {14,12,...,U%n} be a multiset of binary 


strings, each of length s where M is the set of medians for S. Then it is NP-complete 


to determine if 


min |] f(A (%,")) > D. (3-12) 


3.2. STOCHASTIC APPROXIMATIONS 


We have seen several proofs showing that it is hard to calculate many of these quan- 
tities. However, we may further ask if any of the quantities can be approximated. 


One method of approximation is by an FPRAS: 


Definition 3.16. A counting problem #A in #P has an FPRAS (fully polynomial 
randomized approximation scheme) if there is a randomized algorithm such that, for 
any instance of #A and any €,6 > 0, the algorithm outputs an approximation f for 


the solution f satisfying 


f x 
p(-f<f<sate) 21-5 


1 


and the algorithm runs in time polynomial in the size of the instance, ~, and log ;- 


Before stating our results, we define a couple more complexity classes for decision 


problems: 
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Definition 3.17 (Gill (1977)). A decision problem, A, is in the class RP (randomized 
polynomial time) if there is a probabilistic Turing machine that runs in polynomial 
time in the size of the input, returns “true” with probability at least 5 when the answer 


for A is true, and returns “false” with probability 1 when the answer for A is false. 


Definition 3.18 (Gill (1977)). A decision problem, A, is in the class BPP (bounded- 
error probabilistic polynomial time) if there is a probabilistic Turing machine that 
runs in polynomial time in the size of the input, returns “true” with probability at 
least E when the answer for A is true, and returns “false” with probability 2 when the 


answer for A is false. 
One result connecting these classes is the following: 


Theorem 3.19 (Papadimitriou (1994)). If the intersection of NP and BPP is non- 
empty, then RP=NP. 


Note that each result below holds for functions f(a) with log f(x) strictly concave 
down. The analogous results for the functions whose logarithm is concave up are still 


open. Now for our first result regarding sampling of medians for #StarSPSCJ(f). 


Theorem 3.20. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave down, 
e the function values of f can be computed in polynomial time, and 


e there exists € > 0 such that for all but finitely many n € Z, n > 2, 


f(n = 2)[F(n + DP Fn + 2)" Fn + 5) 
f(n— DF M)PF (n+ 3)P f(r + 4) 


For arbitrary m,s € Z7° and D ER, let S := {14,2,...,U%n} be a multiset of binary 


strings, each of length s. If there is a rapidly mixing Markov chain with stationary 


distribution proportional to [let f (H(%, m)), then RP=NP. 
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Proof. Fix a function f as described in the theorem. Because log f(x) is strictly 


concave down, log(f(a))~' is strictly concave up. Set g(x) := (f(x))7. 

Now recall the proof of Theorem 3.5 for strictly concave up functions. Take a 
D3CNF I with n variables and create a multiset of binary strings, D. The set of 
medians for D is M = {0,1}°" x {0}. There is a one-to-one correspondence between 
the medians in the subset M’ = {01,10}" x {0}! and the truth assignments for I’. 
Those medians which correspond to satisfying truth assignments for I’ form the set 
M',. The multiset D is constructed so that each wp € M', has 
I] FiGqn = II 9 g( = 0 ele odo =: hg 


nED nED 


and all other medians have 


1 
ee H > Be n e k-1 = h 
TL Ferg ay = LL 9m) > esvedBroear tn oood = he 


Equivalently, if w € M‘, then 


Il fn.) = a 


nED 
Otherwise, 
i 
IT fA, 4) < 7. 
neED 2 
Further, 
he = Ybad 
hg Ygood 


g(r — ign + 2))*lg(n + 3)? 9(n + 6) 
g(rjig(n + DP lg(n + 4)Plg(n + 5) 
_ fMpffr+ DPF a+ HP @ + 5)] 


f(n—A)[F(r + 2)P [f(r + 3)P f(r + 6) 
1 


> 
l-e 


where the last inequality is a result of the assumption in the theorem statement. As 


a result 


Now select an integer r, dependent only on the values of n and ¢, such that 


(4) > 22"? Create a new multiset D(r) of binary strings such that 


D(r) =DwW...wD. 


r times 
The set of medians for D(r) is the same as the set of medians for D. However this 


time, for a median p € M‘, 
Otherwise, if u € M \ M‘, 
By the choice of r, 


Tene Ne 1 1 us Ly" 
<Q) <Q 
(=) ho = ho l—-e hg 


Since M = {0,1}?" x {O}*, |M| = 2?” and the above inequality shows that for each 


Lo € Mp, 
I] f(4#@,40))> SS [[ f(4™, 4). 


nED LEeM\M', nED 


Further, 


II f#(.H0) > 5 DO TL fH.) 


nED LEM nED 


Now suppose that we had a rapidly mixing Markov chain for this instance of 
#StarSPSCJ(f), as stated in the theorem. From the calculations above, it must 
sample medians which correspond to satisfying truth assignments for I’ with prob- 
ability at least 5. This is precisely an RP for D3SAT. However, this immediately 
implies RP=NP because D3SAT is NP-complete. : 


The following theorem gives the same result as the last one for different functions 


f. In particular, it switches the inequality that f is required to satisfy. 
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Theorem 3.21. Fir a function f(x) : Z=° — R#° which satisfies the following 


properties: 
e log f(x) is strictly concave down, 
e the function values of f can be computed in polynomial time, and 


e there exists € > 0 such that for all but finitely many n € Z, n > 2, 


fn = Difln + VIF + MEF +5) 
F(a — DIFP +3 Fn +4) 


For arbitrary m,s € Z7° and DER, let S := {4,1%,...,U¥m} be a multiset of binary 


strings, each of length s. If there is a rapidly mixing Markov chain with distribution 


proportional to lietm f (H(M%, m)), then RP=NP. 


Proof. The proof for this theorem follows the same line of reasoning as the proof 
for Theorem 3.20. However, it makes use of details in Theorem 3.12 rather than 


Theorem 3.5. a 


When f is a function with log f(x) is concave down, we examine the possibility 


of an FPRAS (Definition 3.16) for #StarSPSCJ(f) . 


Theorem 3.22. Fiz a function f(x) : Z*° — R2° for which: 


e log f(x) is strictly concave down, 
e the function values of f can be computed in polynomial time, and 


e there exists € > 0 such that for all but finitely many n € Z, n > 2, 


f(n = 2)[f(n+ DPF (n+ 2) f(n +5) 
f(n— DF M)PF(n + 3) PR f(r + 4) 


<l-e. 


For arbitrary m,s € Z*° and DER, let S := {1%4,%,...,U¥m} be a multiset of binary 


strings, each of length s. If there is an FPRAS for calculating 


>» IT fA), 


LEM ieE[m] 
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then RP=NP. 


Proof. Let r be an integer so that (4) > 2?n+2. Tn the proof of Theorem 3.20, we 
created a new multiset of genomes D(r). The set of medians M for D(r) is precisely 
{0,1}2" x {0}* and each median yp € M’, which corresponds to a satisfying truth 


assignment for J” has 


All other medians have 


I s.m)<(-). 


nED(r) 


Therefore, if /” has no satisfying assignments, 
TL if : 
S I set) <2 (-) 
BEM n€D(r) 2 
If there is a satisfying assignment for I", then 
1 r 
IL f#G.) = (=) - 
LEM nED(r) 3 


By the choice of r, we have the following inequality to relate the two quantities: 


Ly hoy 
= \ g2nt2 (—) 
(.,) hg 
Now suppose that there is an FPRAS for T := Vyem Mnencr) f(A (0, w)). In other 


words, for any €,6 > 0, there is a randomized algorithm as described in Definition 3.16 


which outputs a quantity T such that 


P(— <P<T(l+0) 21-6 


1 


3 and ¢ =1. Therefore, 


Consider the case when 6 = 


p(or<f<or)>5. 
D) 3 


Therefore, if I’ can be satisfied, then T’ > (4) and the probability that T is at least 


$T = 5 (4) a eae (4) is . On the other hand, if I’ cannot be satisfied, then 
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fe (4) and the probability that T is at most 2T = 22”+! (4) is 3. Therefore, 
we have a BPP algorithm (Definition 3.18) for 3SAT. 


Because 3SAT is NP-complete, Papadimitriou’s Theorem 3.19 implies RP=NP. ss 


A similar result holds for functions f(a) which satisfy the opposite inequality. We 


do not give a proof as it follows the reasoning in the proof of Theorem 3.22. 


Theorem 3.23. Fix a function f(x) : Z=° + R=° for which: 


e log f(x) is strictly concave down, 
e the function values of f can be computed in polynomial time, and 


e there exists € > 0 such that for all but finitely many n € Z, n > 2, 


f(n = 2)[f(n+ DPF (a + 2)P f(r +5) 
f(n—DIFM)P Fn + 3) PR f(n + 4) 


>1+e 


For arbitrary m,s € Z7° and D ER, let S := {4,1%,...,U¥m} be a multiset of binary 


strings, each of length s. If there is an FPRAS for calculating 


> Il fA), 


LEM iE[m] 


then RP=NP. 
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CHAPTER 4 


RESULT FOR BINARY PHYLOGENETIC ‘TREES 


In this chapter, we return our attention to counting most parsimonious SCJ scenarios, 


but this time our phylogenetic trees are binary trees. We use the following definition. 


Definition 4.1. A tree is a binary tree if it is rooted and every non-leaf vertex has 


exactly two children. 
Recall the statement of #BinSPSCJ: 


Definition 1.25. Given arbitrary integer m > 2, let T be a binary tree with m leaves. 
Let B = {v;}@, be an arbitrary multiset of binary strings and a surjective function 
yp: L(T) > B. Define F to be the set of most parsimonious labelings yp! which extend 


yp to V(T). Determine the value of 


~ TL £6), ¢))!: 


y’/€F we E(T) 


The main result of Section 4.2 is the theorem which states #BinSPSCJ is in #P- 
complete. In the first section, we develop several tools and algorithms which lay the 


foundation for our main theorem. 


4.1 ALGORITHMS FOR FINDING MOST PARSIMONIOUS LABELINGS 


Let F = cy Aco A... Ace be a D3CNF with variables {v1,v2,...,Un}. Select new 
variables {w1,W2,...,Wn} which do not occur in I’. For each i € [n], understanding 


subscript n + 1 as 1, define the following D3CNF, 


@; = (vu; V Wi V Vi41) /\ (vu; V Wj V Visi) /\ (Uj V Wi V Vi41) /\ (Uj V Wi V Viti): (4.1) 
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Observe that &; is equivalent to the “exclusive or” (vu; V w;) A (Uj V W;). Define 


Necessarily, if I is a D3CNF then so is W(I’). 


Lemma 4.2. For I’, an arbitrary D3CNF, it is #P-complete to determine the number 


of satisfying truth assignments for (I). 


Proof. We have already shown in Lemma 1.20 that #D3SAT is in #P-complete. So 
to prove this result, we will show that the satisfying truth assignments for I” and for 
W(I°) are in one-to-one correspondence. 

Any truth assignment which satisfies Y(I’), when restricted to {v1, vo,...,Un} will 
necessarily satisfy I’. 

For the other direction, recall that ®; is equivalent to the “exclusive or” for v; 
and w;. Therefore, given a satisfying truth assignment for I", we can create a unique 
satisfying truth assignment for W(I") by assigning to each w; the opposite value of 


Uj. | 


Next we provide two different algorithms for creating most parsimonious labelings 
given a rooted binary tree and a leaf-labeling y : L(T) — {0,1}*. If we restrict y(¢) 
to a single coordinate c for every leaf 2, we obtain a labeling y, : L(T) > {0,1}. Each 
algorithm presented below will consider leaf labels from the set {0,1} and output a 
most parsimonious labeling y!, : V(T) — {0,1}. Obtaining a most parsimonious 
labeling for each coordinate in this way, we combine these labelings to create a most 
parsimonious labeling y’ : V(T) > {0, 1}* for T and the original leaf-labeling y. 

Let T be a binary tree with root p and let y : L(T) > {0,1} be a labeling 
for the leaves. Let y’ : V(T) — {0,1} be a most parsimonious labeling (Def- 
inition 1.4) which extends wy. Because each vertex is labeled with a single bit, 


A(y'(u), y’(v)) € {0,1}. By definition, the most parsimonious labeling y’ mini- 
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mizes the sum D,ex(r) 1(y'(u), '(v)). Consequently, y’ must minimize the number 
of edges uv such that y’(u) 4 y'(v). 


First, we have Fitch’s algorithm to find most parsimonious labelings. 


Fitch’s Algorithm (Fitch (1971)). Let T be a binary tree with root p and leaf- 
labeling p : L(T) + {0,1}. The following algorithm, completed in two parts, will find 


a most parsimonious labeling vy! : V(T) — {0,1} which extends vp. 


Part 1: Define a function B on the vertices of T as follows: For each leaf €, set 
B(0) := {y(O)}. Extend this assignment to all vertices of T by the following 
rule: For a verter u with children v1, v2 such that B(v1) and B(v2) have been 
defined, set 


By) AN Biv2) if Bw) A Blve) 4 B, 
B(u) := (4.2) 


B(v,) U B(vg) otherwise. 


Part 2: Select a single element a € B(p). Define a function y' on the vertices of T 
as follows: Set y'(p) := a. Extend y' to V(T) by the following rule: If v is 
a child of u and y'(u) is defined, then 


(u if p'(u) C Biv), 
Oe y'(u) y'(u) (4.3) 


1-y(u) ify'(u) ¢ Biv). 
The resulting y' is a most parsimonious labeling extending yp and is called a Fitch 


solution. 


While Fitch solutions are most parsimonious labelings, there are cases when 
Fitch’s algorithm finds some of the most parsimonious labelings but not all of them. 
However, Sankoff’s algorithm, described below, will produce all most parsimonious 


labelings (Erdés and Székely 1994). 
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Sankoff’s Algorithm (Erdés and Székely (1994); Sankoff and Rousseau (1975)). 


Let T be a binary tree with root p and leaf labeling yp: L(T) — {0,1}. This algorithm 


is completed in two steps. 


Part 1: 


Part 2: 


Define functions sg and s; on the vertices of T as follows: For each leaf £, 


89(0) := ie (4.4) 


oo otherwise. 


0 #fy(é)=1, 


51(L) : 


oo = «otherwise. 
Extend these functions recursively to all vertices by the following: If vg and 


v1 are children of u and s;(v;) has been defined for all i,j € {0,1}, then 


so(u) := min{so(vo), $1(vo) + 1} + min{so(v1), s1(v1) + 1}, (4.5) 


si(u) := min{so(vo) + 1, 51(vo)} + min{so(v1) + 1, s1(v1) }. (4.6) 


Note: For anyv € V(T), s;(v) counts the minimum number of edges, within 
the subtree containing v and its descendants, that will witness a change if 
a most parsimonious solution labeled v with the value 1. A leaf will have 
so(l) = c0 (or s1(£) = co tf it is impossible for a most parsimonious labeling 
to label £ with a 0 (1), because most parsimonious labelings must agree with 


the original leaf label. 


For eachv € V(T), select a, € {0,1}. Define the function y' on the vertices 


of T as follows: For root p, define 


Extend vy’ to V(T) by the following rule: If v is a child of u and y'(u) is 
defined, then define y'(v) as follows: If y'(u) = 0, then 


0 if so(v) < si(v) +1, 
9'(¥) = Yay if so(v) = 5;(v) +1, (4.7) 
1 éf so(v) > si(v) +1. 
If y'(u) =1, then 
1 if si(v) < so(v) +1, 
9'(¥) = Yay if :(v) = so(v) +1, (4.8) 


0 if si(v) > so(v) +1. 


The resulting y' is a most parsimonious labeling for T extending y and is called a 
Y~ 


Sankoff solution. 


The following lemma draws a connection between the solutions found from each 


algorithm. 


Lemma 4.3. Let T be a binary tree with leaf-labeling p : L(T) > {0,1}. Suppose 
that, for each u,v € V(T) with v a child of u, the function B in Fitch’s algorithm 


satisfies 
Bly) =40.1p + Bu) = {05 14. (4.9) 


Then for T and y, all Sankoff solutions are Fitch solutions. In other words, Fitch’s 


algorithm finds all most parsimonious labelings. 


In order to prove Lemma 4.3, we first establish a series of claims (4.4 through 4.7) 


under the assumptions of Lemma 4.3. 


Claim 4.4. For any non-leaf verter v, if B(v) = {x} for some x € {0,1}, in Sankoff’s 


algorithm so(v) =0 and s\(v) = 2. 
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Proof. The proof proceeds by reverse induction on the distance from the root. For 
the base case, we consider those vertices whose children are both leaves. Let v be such 
a vertex with children vg and v,. By symmetry of the argument, assume B(v) = {0}. 
Then B(ve) = B(v,) = {0} which only happens for leaves if y(ve) = y(v,) = 0. 
By (4.4), so(vce) = so(v,-) = 0 and s1(ve) = si(v,) = oo. As desired, (4.5) implies 
So(v) = 0 and (4.6) implies s;(v) = 2. 

For the inductive hypothesis, assume that each vertex v of distance at least d > 1 
from the root has either so(v) = 0 and s;(v) = 2 or so(v) = 2 and s;(v) = 0. Let u 
be a vertex of distance d — 1 from the root. Again, we assume B(u) = {0} as the 
argument for the case when B(u) = {1} is very similar. This vertex has two children, 


ue and u,. There are three cases to consider. 


(1) If we and u, are leaves, then the argument in the base case gives sg(u) = 0 and 


8,(u) = 2 as desired. 


(2) If ue is a leaf and u, is not a leaf, then so(u,) = 0 and s;(u,) = oo and, by 
the inductive hypothesis sg(ue) = 0 and s;(ue) = 2. Therefore (4.5) implies 


so(u) = 0 and (4.6) implies sy(w) = 2. 


(3) If ue and u, are not leaves, then by the inductive hypothesis, so(ue) = so(u,) = 0 
and s1(ue) = si(u,) = 2. Again, (4.5) implies so(w) = 0 and (4.6) implies 


Sit) = 2. 
This complete the proof of the claim. : 


Claim 4.5. For any vertex v with B(v) = {0,1} from Fitch’s algorithm, we will have 


so(v) = s1(v) in Sankoff’s algorithm. 


Proof. This claim is also proven by induction on distance from the root where the 


base case examines those vertices with greatest distance from the root. 
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For the base case, let v be a vertex with B(v) = {0,1} and none of its descendants 
u have B(u) = {0,1}. For children ve and v, of vu we may assume B(ve) = {0} and 
B(vu,) = {1} by (4.2). By Claim 4.4, 
80(ve) = $1(v,) = 0 and so(v,) = 51 (ue) = 2. 
Therefore so(v) = s,(v) = 1. 
For the inductive hypothesis, suppose all vertices u with B(u) = {0, 1} of distance 


at least d > 1 from the root have so(ue) = si(u,). Let v be a vertex at distance d—1 


from the root with B(v) = {0,1}. There are three cases to consider: 


(1) If v has a child vp with B(ve) = {0}, then by (4.2) the other child v, must have 


B(v,) = {1} and we can use the argument in the base case to see so(vz) = $1(v;). 


(2) If v has a child ve with B(ue) = {1}, then by (4.2), v must have another child 


v, with B(v,) = {0}. This puts us back in case 1. 


(3) If v has a child ve with B(ve) = {0,1}, then by (4.2), v must have another 
child v, with B(v,) = {0,1}. By the inductive hypothesis, s9(vg) = s1(ve) and 
S0(U,) = 81(v,-). By (4.5) and (4.6), so(v) = s1(v). 


This completes the proof of the claim. a 


Claim 4.6. For any non-leaf vertex v with B(v) = {i} (i © {0,1}), both Fitch’s 


algorithm and Sankoff’s algorithm will define y'(v) = 1. 


Proof. In Fitch’s algorithm, this is an immediate consequence of (4.3). 

Now consider Sankoff’s algorithm. If B(v) = {0} then, by Claim 4.4, so(v) = 0 
and s;(v) = 2. Observe sg(v) < s1(v) +1 and s;(v) > so(v) + 1. Therefore y’(v) = 0 
by (4.7) and (4.8). = 


Claim 4.7. Suppose B(p) = {0,1}. For any vertex v with B(v) = {0,1}, if both 
algorithms set y'(p) := 0, then both Fitch’s algorithm and Sankoff’s algorithm will 


set y'(v) = 0. Likewise, if y'(p) = 1, then both algorithms will set y/(v) = 1. 
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Proof. For any vertex v with B(v) = {0,1}, there is a path p = uo, u1,..., Ue_1, Ut = V 
of vertices such that B(u;) = {0,1} for each 7 € [¢t]. It suffices to show that, in both 
algorithms, if y’(u;) = 0 for some 0 <i < t, then y’(uj41) = 0. 

In Part 2 of Fitch’s algorithm, if y’(u;) = 0 and B(uj4,) = {0,1}, then (4.2) 
implies y’(ujii) =0. 

In Sankoff’s algorithm, if y’(u;) = 0 and B(uji1) = {0,1}, then by Claim 4.7, 


So(Uizi1) = Si(tizi1). Thus so(uigi) < s1(uiz1) +1. Since y’(u;) = 0, (4.7) implies 


yg’ (iti) = 0. 
A similar argument can be used to show that if y’(p) = 1, then y’(v) = L. 


Therefore Fitch’s algorithm and Sankoff’s algorithm will agreed on y’(v) if they agree 


on '(p). . 


Proof of Lemma 4.3. In each algorithm, once y(p) has been set, the algorithm deter- 
ministically outputs a most parsimonious labeling of all vertices. Therefore, it suffices 
to prove that both algorithms have the same choices for labeling the root and both 
algorithms output the same most parsimonious labeling for the same choice for y’(p). 

If B(p) = {0} or B(p) = {1}, then there is only one choice in Fitch’s algorithm 
for y'(p). By Claim 4.4, Sankoff’s algorithm has the same determined value for y’(p). 
Further, all vertices v € V(T) will have either B(v) = {0} or B(v) = {1} by condition 
(4.9) and Claim 4.6 completes the proof. 

If B(p) = {0,1}, then in Fitch’s algorithm, there are two choices for y’(p). By 
Claim 4.5, s9() = si(p) in Sankoff’s algorithm, which means there are also two 
choices for y’(p). Claim 4.7 implies that if we make the same choice for the root, 
both algorithms give the same most parsimonious labeling y’. 

Sankoff’s algorithm is guaranteed to find all most parsimonious labelings and 
the most parsimonious labelings from Fitch’s algorithm coincide with those from 
Sankoff’s algorithm, this implies that Fitch’s algorithm finds all most parsimonious 


labelings. : 


75 


As mentioned earlier, these algorithms are designed for a tree T’ with leaf-labeling 
yp: L(T) > {0,1}. However, given a tree T with leaf-labeling ¢ : L(T) > {0,1}, we 
can restrict all strings to a single coordinate and run one of the above algorithms to 
find a most parsimonious labelings for V(7’) in that coordinate. Repeat this for each 
coordinate. The most parsimonious labelings found for each coordinate can then be 


combined into a most parsimonious labeling of V(T’) that extends @¢. 


4.2 COMPLEXITY RESULT FOR #BINSPSCJ 


Here is our main result on binary trees. 
Theorem 4.8. #BinSPSCJ is #P-complete. 


Proof. In Lemma 1.23 we saw that #SPSCJ is in #P. To prove #SPSCJ is in #P- 
hard, we provide a polynomial reduction from #D3SAT. 

Fix a D3CNF, I’ = Ajeqxy ci with k clauses and n variables. Let 

wWI):= Nar f & 
i€[k] i€[n] 

with 2n variables, {v1,v2,...Un, W1,W2,..-,Wn}, where each clause c; has three dis- 
tinct literals from {v;,0; : 7 € [n]}, and ®; is the D3CNF in (4.1) which guarantees 
that, for each i € [n], v; and w; have different truth values. By Lemma 4.2, I and 
W(I’) have the same satisfying truth assignments. We will construct a binary tree B 
and define a labeling y of its leaves such that the number of satisfying truth assign- 
ment for W(I") is directly computable from the number of most parsimonious SCJ 
scenarios on B. 

Each @; has 4 clauses, so W(I’) has k+4n clauses. Assign the names cg41,.-., Ck+4n 


to the 4n clauses of A j<j,) ;. Then 
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For each i € [k+4n], we define a binary tree 6; which encodes clause c;. The final 
binary tree B will join B,, Bo,..., Bean by a comb. For t = 148(16n? +8kn)(k+ 4n), 
the leaf-labeling vy : L(B) + {0,1}°""", will assign a binary string with coordinates 
(11, Y1,---52n; Yn €1;---,e¢) to each leaf. The x; coordinates will correspond to the 
vu; variables and the y; coordinates will correspond to the w; variables of W(I’). The 
e; coordinates will be for additional ones, used in a manner similar to the additional 
ones in the previous two chapters for star trees. 

In this chapter, we denote the left child of a non-leaf vertex v by vg and the 
right child by v,. The height of a vertex is its graph distance from the root. The 
construction of 6; with its leaf labeling y will come in Definition 4.12, but first we 
need some preliminary definitions. 

For any clause ¢; = Vg V vg V vy which is the disjunction of 3 distinct literals, 
Miklos, Kiss, and Tannier (2014) defined a unit subtree, U;, with 248 leaves. They 
also defined a leaf-labeling ¢ : L(U;) + {0,1}'°' where the binary strings in the range 
have coordinates (#9, %3, 24, €1, €2,...,€148). The first three coordinates correspond 
to the variables in c; and the remaining 148 coordinates are for additional ones. This 
unit subtree has some useful properties which will be discussed after Definition 4.12. 

For each i € [k + 4n], let U; be the unit subtree for clause c;. If i < k where ¢; 
relates Ug, Ug, Uy, then U; will have leaf labels with coordinates {v,%g,x 7} and 148 
coordinates for additional ones. If i > k where c; relates variables vq, Wa, Va41, U; will 
have leaf labels with coordinates {%a,Yo,%a+1} and 148 coordinates for additional 


ones. 


Definition 4.9. The tree J; in Step 5 of Definition 4.11 is a comb joining 16n?+8kn 


copies of U;, as in Figure 4.1. 


Definition 4.10. For three literals a,b,c, we define S(a, b,c) to be the complete binary 


tree of height 3 with root p with the vertices labeled with equations as follows: 


ws 


Figure 4.1. Comb connecting 
16n? + 8kn copies of U; to create J; 


1. Assign the label “a =0” to vertex pe and “a=1” to py. 

2. For each vertex u of height 1, assign the label “b= 0” to ug and “b=1” to ur. 

3. For each vertex v of height 2, assign the label “c= 0” to ve and “c=1” to v,. 
This tree is pictured on the right in Figure 4.2. We will use the representation on the 
left in place of S(a,b,c) in future figures. 


Next we construct B; which will have the same tree structure as B;. However, B; 
will have all of its vertices labeled with equations while 5; will only have leaf labels 
which are binary strings. The leaf labeling of B; will be induced by the vertex labels 


of B;. Each leaf will essentially inherit the labels of its ancestors. 


Definition 4.11. Fir i € [k + 4n]. Construct B;, a binary tree with vertex labels, as 


follows. 


A. Ifi © [k], then say clause c; has variables va,vg,vy. The construction of B; 
described below is drawn in Figure 4.2. 
a) Draw a vertex p' with two children, p’, and p%.. 


b) Label vertex pi, with the equations “x; = y; = 0” for each j € [n]\{a, 3, y}. 


Label p'. with “x; = y; =1” for all j € [n] \ {a, B, 7}. 


c) From each of pi and pi, hang a copy of S(Yo, Ye, Yy)- 
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Figure 4.2 The labeled binary tree on the right is S(a, b,c). The 
representation on the left will be used in place of S(a, b,c) in future 
figures. 


d) From each leaf of each copy of S(Ya,4¥s,Yy), hang a copy of S(%o,XB, Ly). 


e) Delete the left-most copy of S(%q,%g, 2), the one which hangs below the 
vertices with labels “yo = 0,” “yg = 0,” “yy = 0,” and with ancestor pi, 


and replace it with a copy of TJ; from Definition 4.9. 


B. Ifi € {k+1,...,k +4n}, then clause c; relates variables vg,Wea,Va4t1- The 
construction of B; described below requires only a change of variables from the 
previous construction. 

a) Draw a vertex p' with two children, p’, and p%.. 

b) Label pi with “x; = y; =0” for all gj € [n]\{a,a+1,a+2}. Label p’. with 
the system of equations “x; = y; =1” for each j € [n] \ {a,at+1,a+4 2}. 

c) From each of pi, and p'., hang a copy of S\(Ya+1; Lo+2s Yar2): 


d) Hang a copy of S(®a;Ya;%a+1) from each leaf of each and every copy of 


S(Yat1 Lat2, Yo+2): 


e) Delete the left-most copy of S(%a,Ya;ta41) and replace it with a copy of 
Ti from Definition 4.9. 


Recall 6 is a comb connecting binary trees B;. The binary tree B has a leaf 


labeling y : L(B) — {0,1}?"** where 


t = 148(16n? + 8kn)(k + 4n). 
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TF Y= i 
Vj €[n],j = Vj €[n],j = 4 


Figure 4.3. The binary tree B,, for i € [k] created for 
clause c; = ©] V Xo V £3. 


Each leaf label will have coordinates (21, 41,.--,;2n; Yn; €1,---,¢)- In the next defini- 


tion, we define 6; and values of y on the leaves of B;. 


Definition 4.12. For each i € [k+4nl], the binary tree B; will have the same tree 
structure as B;. We only need to explain the labeling y : L(B;) > HO eae 

Partition |t| into classes Ej; with |E;;| = 148 for each i € [k + 4n] and each 
j € [16n? + 8kn]. Identify the set E;; with the 7 copy of U; in T;. Here we define 
p(t) for each € € L(B;). 


There are two cases: 


e /f leaf @ is not in subtree T;, then vp(£)[e,] = 0 for all s € [t]. The value of each 
p(l)|tw] and y(2)[yw] for w € |n] is inherited from the labels of the ancestors of 


£ as they appeared in B;. 


e If leaf ¢ is in the subtree TJ; within B;, then it is a leaf within the j** copy of 
unit subtree U; for some j € [16n* + 8kn]. Recall G(é) € {0,1}! Identify 
the coordinates {é1, €2,...,é€143} with the indices in the 148 coordinates of Ej; 
in any order. If €, corresponds to coordinate e, forr € Ej, then we require 
p(L)le,] = G(2)[é,]. For each coordinate z which corresponds to a variable in c;, 


we also require that y(£)[z] = G(£)[2]. Set p(Ale.] = 0 for s ¢ Ej;. All other 
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coordinates of p(t) will take the value 0 (the value inherited from the labeling 


of their ancestors in B,. 


Define yz, : L(B) + {0,1} so that y2,(€) = y(4)[z,;]. Define yy, and ye, similarly. 
We want to examine the Fitch solutions on B for each Yz,, Yy;, and ~-,. We will first 
prove that the conditions of Lemma 4.3 hold and thus Fitch’s algorithm find all most 
parsimonious labelings for y on B. 


We first explore the Fitch solutions for ye,, j € [¢]. 


Fact 4.13. Fix j € |t]. There is only one ¢ € L(B) with ye,(t) = 1. After running 
Part 1 of Fitch’s algorithm, B(@) = {1}, the parent v of € has B(v) = {0,1}, and 
B(u) = {0} for all other vertices. Consequently, Part 2 of Fitch’s algorithm will 
output a most parsimonious labeling Le, such that Le, (€) = 1 and for all other vertices 


uéEV(B), v,(u) =0. 


Proof. These values of B follow directly from the description of y(@)[e;], for leaf 2, 
which was given in Definition 4.12. The conclusion follows from the definition of y’ 


(4.3). = 


Fact 4.14. For j € |t], there is only one most parsimonious labeling Le; which extends 


leaf labeling (pe, of B. 


Proof. Recall that most parsimonious labelings minimize the sum of Hamming dis- 
tances between adjacent vertices in the tree. The most parsimonious labeling obtained 
from Fitch’s algorithm has 
~~ Av, (u), ¥2,(v)) = 1. 
uve E(B) 
Because there is only one leaf ¢ with y.,(¢) = 1, the y, obtained from Fitch’s 
algorithm is the only extension of y,, with the sum of Hamming distances equal 


1. | 
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Fix j € [n]. Next we consider the most parsimonious labelings for y,, on B. The 
same arguments will hold for each yy,. 
Run Part 1 of Fitch’s algorithm on B with leaf labeling y,,. For those clauses c¢; 


which contain variable v;, we have the following result. 


Proposition 4.15 (Miklés, Kiss, and Tannier (2014)). Fix a clause c;. Suppose 
variable vu; is in c; with coordinate x; corresponding to variable v;. Let r' be the root 
of unit subtree U; for c;. Run Fitch’s algorithm on U; with leaf labeling p2,;. The 
following hold: 


1. B(r*) = {0,1}. 


2. Foru,v € V(U;), if v is a child of u, then B(v) = {0,1} > B(u) = {0,1}. 


In a single copy of S(a, b,c), all vertices of the same distance from the root either 
have B(v) € {{0}, {1}} or all of them have B(v) = {0,1}. This fact together with 
Proposition 4.15 implies that, when Fitch’s algorithm is run on 6 with leaf-labeling 


y(x;), for any u,v € V(T) with v a child of u, 
Biv) ={0.1 pS Bw) = {0,1}. 


With this result and the structure of each S(a,b,c), By Lemma 4.3, we can con- 
clude that Fitch’s algorithm finds all most parsimonious labelings of B that extend 
Ye; Further, B(p) = {0,1} implies there are exactly two such most parsimonious 
labelings. 

As mentioned earlier, these results also hold for coordinate y;. Fitch’s algorithm 
finds the only two most parsimonious labelings that extend y,, on B. 

For most parsimonious labeling y’ that extends y, on each v € V(T), notate 


y'(v) [xj] by y,. Likewise, define the notations vy, and ¢%,. 
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Lemma 4.16. For leaf labeling y of B, Fitch’s algorithm finds all most parsimonious 
labelings. Each is characterized by the string it assigns to the root p of B and there are 


precisely 2°" most parsimonious labelings, one for each root label in {0,1}?" x {O}". 


Proof. Given a most parsimonious labeling y’ that extends vy, each Pr, Pus and yt, 
is a most parsimonious labeling for that coordinate. So it suffices to first find all most 
parsimonious scenarios for the leaf labelings Yz,,y,,e, for all 7 € [n] and s ¢€ [{t] 
and take combinations of these labelings. 

We have already seen that Fitch’s algorithm will find all most parsimonious la- 
belings for y,, and y,,, and there are exactly 2 of each. Fitch’s algorithm will also 
find the one and only most parsimonious labeling for y.,. Therefore, there are 2?” 
most parsimonious labelings of B that extend y. Part 2 of Fitch’s algorithm shows 
that each most parsimonious labeling is characterized by the string it assigns to p. 
Since B(p) = {0,1} for each yz, and yy, and B(p) = {0} for each y-,, the possible 


strings for y/(p) are {0,1}?" x {0}. | 
Set M <= {0,1}?" x {0} 


Definition 4.17. There is a bijection between M and the possible truth assignments 
for W(I’). In particular, given any u € M, define a truth assignment for variables 


{uj}, U {wi}, as follows: 
e For eachi € [nl], let vu; be assigned the value true if p[x;] = 1 and false otherwise. 
e For eachi € |n], let w; be assigned the value true if |y;| = 1 and false otherwise. 


Define Myr) to be the set of uw € M which correspond to satisfying truth assignments 
for W(I). Likewise, for any O, a clause or conjunction of clauses from W(I’), define 


Meo to be the set of up € M which correspond to satisfying truth assignments for O. 


Now we know that each most parsimonious labeling of B extending y is found 


using Fitch’s algorithm and is characterized the binary string it assigns to the root. 
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From here, we are interested in the number of SCJ scenarios admitted by each of these 
most parsimonious scenarios. Ultimately, we wish to make a distinction between the 
binary strings in Myr) and those in M \ Mur) by examining the number of SCJ 
scenarios admitted by the corresponding most parsimonious labeling. 

Let y’ be a most parsimonious labeling for B. The number of scenarios which are 
admitted by y’ is precisely 

H(y'(e)) = TT Aly), ¢(e))!. 
uve E(B) 

To calculate this, we partition the edges of B into 4 sets. 

First, consider the edges of the comb which connects Lean to form 6. Part 2 
of Fitch’s algorithm will set y’(p) = y'(p') where p' is the root of B;. So the Hamming 
distance along each of these edges is 0. 


Next we look within each B;. 


Claim 4.18. Set d:= A\@,. Fori € [k+A4nl, let p' be the root of B; with children p', 


and pi.. Set n:= y'(p"). Ifn € Mg, then 
A(n, ¢'(pt)) = H(n, ¢'(p')) =n - 3. 


Otherwise 
(n— 3)? < H(n, p'(p2))! - H(n, ¢'(e;))! < (2n — 6)!01. 


Proof. Suppose n € Mj. Then for each j € |[n] considered in Step 2 of Definition 4.11 


(there are n — 3 such j) , if y[x;] = 0 then we have the following properties: 
e 7|y;| = 1 because 7 corresponds to a satisfying assignment for &, 
e 0=n[x)] 4 ¥'(o,)[z5] = 1, 
e 1 = nly] 4 ¥'(r%) [yi] = 0. 


On the other hand, if n[z;] = 1 then we have the following properties: 
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e 7|y;| = 0 because 7 corresponds to a satisfying assignment for &, 

e 1=n[2,] 4 o'(o) [x5] = 0, 

© 0= nly) 4 ¥'(o,)[ys] = 1. 
For each s € [t], nles}] = ¥'(p)[es}] = (pi) [es] = 0. For each 7 € [n] which was 
not considered in Step 2 of Definition 4.11, n[x;] = y'(e%)[z;] = y’(p%)[x,] and 


niyil = ¢'(er)[ys] = ¢'(')[y;] because the B values (from Fitch’s algorithm) for 


these coordinates at these vertices will be {0,1}. Thus 
A(n, ¢'(o0)) = H(n, ¢'(o,)) = 0 - 3. 


Alternatively, if 7 ¢ Mo, then H(n, y'(p)) + H(n, y'(p1.)) = 2n — 6 because each 
x; and each y; will contribute 1 to one of the Hamming distances. Using the convexity 


of the factorial, this establishes the last line of the claim. rT] 


Based on the construction of S(a, b,c), the Hamming distance H(y’(u), y’(v)) for 
each edge uv in each copy of S(a, b,c) is exactly 1. 

The only piece remaining is 7;. We make the following remarks for the clause 
Cc; = Uz V v2 V v3 to make the explanation easier. However, the arguments can be 


extended for any clause c; in W(J’). 


Fact 4.19. If t' is the root of J; andr‘ is the root of one of the copies of U; below Ti, 


then running Fitch’s algorithm for each coordinate, we find 
e B(t') = B(r*) = {0,1} for each x;, 7 € [3], by Proposition 4.16. 
e B(t') = B(r*) = {0} for each x;, 1 > 4, by the construction of B;. 


e B(t') = B(r*) = {0} for each y;, i € [n], by the construction of B;. 


e 
= 
$ 


B(t') = B(r’) = {0} for each es, s € |t], because there is only one leaf £ € L(B) 
with p()les| = 1. 
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Therefore, it is easy to see that, along the edges of the comb which connect the 
copies of U;, the Hamming distances will be 0. 


Next we turn our attention to a single copy of U4, say the 7" copy. 


Fact 4.20. Fix I and build binary tree B. Fix a most parsimonious labeling vy’ 
which extends leaf labeling yp. For clause c; = v1 V v2 V v3, we have the following 


characteristics for each v € U,, 
© fors>4, ov) [x] =0, 
e for s € [nl], y'(v)[ys] = 0, 


e fors ¢ E,;, y'(v) les] = 0. 


Therefore, only the values of y’(v) on the coordinates 71, 72,73 and e, for s € Ej; 
will affect the Hamming distances along the edges in U;. These are precisely the 
151 coordinates that appeared in the originally labeling ¢ of the leaves of Uj; given by 
Miklés, Kiss, and Tannier (2014). For each v € V(y;), define ¢’(v) : V(U;) + {0,1}2! 
to be the restriction of y’(v) to these 151 coordinates. In particular, ¢’ is a most 
parsimonious labeling on U/; which extends leaf labeling ¢. 


The following fact is a consequence of Fact 4.20. 


Fact 4.21. Let r’ be the root of U;. If y'(r*) = ¢'(r*), then for each uv € E(U;), 


As a result 


II 4’), ¢'))!= TT #4), eo) 


uveli; uveli; 


This is calculated as follows: 


Fact 4.22 (Miklés, Kiss, and Tannier (2014)). Fix i € [k +4n], the binary tree U; 
with root r', and leaf-labeling 6. Then for any most parsimonious labeling @' which 


extends ~: 
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1. If @'(r*) corresponds to a satisfying truth assignment for c;, then 


I] 2 (u), (w))h = 2 x 3. 


uveli; 


2. If @'(r*) corresponds to a truth assignment which does not satisfy c;, then 


I] 2G '(u), Bw)! = 2 x 3%. 


uveli; 


Since y'(p) = y'(r’) = ¢'(r"), v'(p) corresponds to a satisfying truth assignment 
for c; if and only if ¢’(r*) also corresponds to a satisfying truth assignment for c;. 


As a result of the above discussion, we have proven the following claim. 


Claim 4.23. Fixi € [k + 4n]. If y'(p) corresponds to a satisfying truth assignment 


for clause c and Netn} 2c; then 


I] Fy), ¢'())! = - 3)? (oy y Se 


uve B(B;) 
If y'(p) corresponds to a truth assignment which does not satisfy c;, then 


16n?+8kn 
(rn — 3)? (29% x a)" < TT (yu), o'(v))! 
uve E(B;) 


16n?+8kn 
< (2n = 6)! (213 x 37) 


If y'(p) corresponds to a truth assignment which satisfies c; but does not satisfy 


Mie{n| Pi, then 


(n — 3)!2 (2196 x 3)" © TT H(y'(w),¢'())! 


uve E(B;) 


16n?+8kn 
< (2n — 6)! (2 x 3) 
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Observe, 


(Qn — 6)! [2136 x g76]16r7+8kn— P3127 161+ 8kn , 

(n — 3)!2 [2156 x goa er een > Fs (: ae) 
312 saat 

=| 

_ 16n?+8kn 


920 


x 


g2ntk 


x 


16n2+8kn 
z saree 


312 | 16n?+48kn 


x 


919.5 


x 
= 


Consequently, 


2 
3) 16n*+8kn 


(2n — 6)! (2'8° x ae) a icy bal 


< (2n — 6)! (2158 x 364) 26n?+8hn 


Claim 4.24. If y'(p) corresponds to a satisfying truth assignment for W(I’), then 


16n248kn] k+4n 


H(y'(p)) = (n = 3) Cas % 3”) 


=: Basa: 


If y'(p) corresponds to a truth assignment which does not satisfy W(I’), then there 
must be a clause c; for some i € |k + 4n| which is not satisfied. Therefore, 


H('(p)) <(2n — 6)! (218 x 376)" 


16n2 +8kn k+4n-1 


: (an — 6)! (a x oo) 


=:Boaa- 


Define 
Brotal += os H(y'(p)) 
gy! 
which is the total number of most parsimonious SCJ scenarios for B which extend 


leaf labeling y, as in Definition 1.22. 
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Given only Biotai, we would like to determine the number of satisfying truth 


assignments, |S|, for W(I’). 


Brotal = os H(n) a ee H(1’) 


nEM yr) EM\M yr) 


=|S|Bgoout D>  H(7/). 


NEM\MG (py 


As long as Le EM\ M's ry H(17') < Bgooa, We can conclude that the number of satisfying 


truth assignments for W(I°) (and for I’) is precisely 


| 
Boeed 
Observe, for n > 2, 
92n Bes oe 312 16n?+48kn cae k+4n 
Bisa 220 n—3 
312 16n?+8kn 
< gan Fs g2n(k+4n) 


920 


24 


24 


920 


312 16n?+8kn 
= a 1/2 | 


<b 


Because there are only 2?” truth assignments and 2?” most parsimonious labelings, 
we obtain our desired result: 
YA) Ss So Bits 2 Bia Boor 
n' CU n'€U 
Therefore, if we could determine the total number of most parsimonious scenarios 
for this binary tree in polynomial time, then we could obtain the total number of 
satisfying assignments for W(I”) and for I’ in polynomial time. This completes the 


proof. rT 
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CHAPTER 5 


ECCENTRICITY SUMS IN TREES 


The eccentricity of a vertex v in a connected graph G is defined in terms of the 
distance function as 


= d : 
eccg(v) eS (u,v) 


The radius of G, rad(G), is the minimum eccentricity while the diameter, diam(G), 
is the maximum. The center, C(G), is the collection of vertices whose eccentricity is 
exactly rad(G). 

We focus our attention on trees, where the center has at most two vertices (Jordan 
1869) and the diameter is realized by a leaf. We also explore the total eccentricity of 
a tree T’, defined as the sum of the vertex eccentricities: 

Ece(T) := S > ecer(z). 
z€V(T) 

For a fixed tree T with v € C(T) and any z € V(T), 


Ecc(T’) 
ueL(T) eccp(u) 


Ecc(T) 
ecer(z) 


Ecc(T) 
eccr(v) 


- < 


where L(T’) denotes the leaf set of T. This motivates the study in Section 2 of 
the extremal values and structures for the following ratios where u,w € L(T) and 


vEC(T), 
Ecc(T)  Ecc(T) — ecer(u) id eccy(u) 
ecer(v)’ ecer(u)’— ecer(v)’ ecer(w) 


The results are analogous to similar studies on distance in (Barefoot, Entringer, and 


Székely 1997) and on the number of subtrees in (Székely and Wang 2014; Székely and 
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Wang 2013). As in those papers, the behavior of ratios is more delicate than that of 
their numerators or denominators. 

For a graph with n vertices, the total eccentricity is n times the average eccen- 
tricity. Dankelmann and Mukwembi (2014) gave sharp upper bounds on the average 
eccentricity of graphs in terms of independence number, chromatic number, domi- 
nation number, as well as connected domination number. For trees with n vertices, 
Dankelmann, Goddard, and Swart (2004) showed that the path maximizes Ecc(T). 
In Section 3, we prove that the star minimizes Ecc(T) among trees with a given 
order. Turning our attention to trees with a fixed degree sequence, we prove that 
the “greedy” caterpillar maximizes Ecc(T’) while the “greedy” tree minimizes Ecc(T)). 
This provides further information about the total eccentricity of “greedy” trees across 
degree sequences. 

From here forward, we assume that T is a tree with n vertices. Given two vertices 


a,b€ V(T), P(a,b) will be the unique path between a and 0 in T. 


5.1 EXTREMAL RATIOS 


In this section, we consistently use the letters u,w to denote leaf vertices while v is 
a center vertex. Before delving into ratios, the following observation from Jordan 
(1869) is given without proof, and will be used many times. The next observation is 


a simple calculation which will be useful in our proofs. 


Observation 5.1. The center, C(T), contains at most 2 vertices. These vertices are 
located in the middle of a mazimum length path, P. If {v} = C(T), v divides P into 
two paths, each of length rad(T). If {v,z} = C(T), the removal of vz € E(T) will 
divide P into two paths, each of length rad(T) — 1. 
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Observation 5.2. For any path P with y edges and y +1 vertices, 


3 . . 
Sy +y if y is even 
KectP).=: - So .ecen zy = : 
zE€V(P) 


Sy? +y+4 if y is odd. 


5.1.1 ON THE EXTREMAL VALUES OF 


Eee(T), WHERE v € C(T) 


eccy(v) 
Theorem 5.3. Let T be a tree with n > 2 vertices. For any v € C(T), we have 


Ecc(T) 
ecer(v) 


< 2n—-1. 
For n > 3, equality holds if and only if T is a star centered at v. 


Proof. Let T be an arbitrary tree with v € C(T). It is known that for any tree T, 
diam(T) < 2rad(7’) and for any vertex z € V(T), rad(T) < ecer(z) < diam(T). 


Because ecer(v) = rad(T’), the bound in the theorem is proved as follows: 
Ecc(T) < ecer(v) + (n — 1) diam(T) < (2n — 1) rad(T). 


Equality holds precisely when T has eccr(z) = 2eccr(v) for all vertices z # v. 
Because the eccentricities of adjacent vertices differ by at most 1, ecer(v) = 1 and 


eccr(z) = 2 for all z £ x which is only true for the star. = 


Theorem 5.4. Let T be a tree with n > 2 vertices. Let k andi be nonnegative 


integers withO<i<2k andn=k? +1. For any v € C(T), we have 


Bee(T) nm—-34+2k+2 if0<i<k 


eccr(v) 


n—-3+2k+ 7-5 ifkt+1<i< 2k. 


For n > 4, equality holds if and only if T is a tree whose longest path has 2x vertices 
(x =k in the first case and x = k+1 in the second) and each other vertex is adjacent 
to one of the two center vertices of this path. For1 =k, the two bounds agree and 


both values for x provide an extremal tree. 
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Proof. Let T be a tree with n > 8 vertices and let v € C(T). If T is a star then 


Ecc(T’) 
eccy(v) 


= 2n — 1 which is strictly greater than the bounds in the theorem. 

For the remainder of the proof, we consider the case when J’ is not a star. By 
Observation 5.1, there is a maximum-length path P := P(u,w) with v in the middle 
and d(u, v) = eccr(v). We now consider two cases, based upon the size of C(T). 

If C(T) = {v}, then both P(u,v) and P(v, w) have length eccr(v). Let S' be the 
non-empty set {w’ € L(T) : w’ € uand d(v,w’) = eccr(v)}. Create a new tree F 
from T by detaching each leaf w’ € S and appending each one to v. This tree is 
different from T because T was not a star. For any z € V(T), ecer(z) > eccr(z). 
Further, for each w’ € S, eccr(w’) > eccr(w’). As a result, Ecc(T) > Ecc(F’). As for 
v € C(T), eccr(v) = eccr(v) = dr(u,v) because u ¢ S. The length of the longest 


path in F is one less than the length of the longest path in T’ which implies v € C(F’) 


and |C(F)| = 2. Altogether, we see ee > ene Hence, to minimize ee it 
suffices to consider those trees with two center vertices. 

Suppose |C(T)| = 2 and let x := eccr(v). Here, the path P has length 2x — 1 
and the vertices on P realize their eccentricities along this path since it has max- 


imum length. Explicitly calculating the eccentricities of the vertices on P, using 


Observation 5.2, and lower bounding all other eccentricities by x + 1, we have 


Bool?) 5 ¥ ((aet 2) +(r-20 +1) = + (0-9) 


Equality holds if and only if each vertex not on P is a neighbor of one of the center 


vertices of P, as in Fig. 5.1. 


U U 
o——_e- - - - + e eo ----e e 
a—1 vertices oi ae on: x—1 vertices 
eS 


n—22x vertices 


Ecc(T’) 
eccr(v)* 


Figure 5.1 A tree which minimizes 


To determine the value of x which minimizes f(), we use the first derivative test. 
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Because f'(x) = 1— +5 is negative for x < \/n and positive for 7 > \/n, the minimum 


of f(x) is obtained when 

x € {| vn], [vn]} {kk +1}. 
Because f(k+1)—f(k) = iaeie f(k) < f(k+1) precisely when i > k with equality 
when 7 = k, as stated in the theorem. rT] 


Ecc(T) 
eccr (wu) 


5.1.2 ON THE EXTREMAL VALUES OF 


WHERE wu € L(T) 


Theorem 5.5. Let T be a tree on n > 8 vertices. Let k andi be integers with 


0<i<2k and2n—1=k? +i. For anyu€ L(T), we have 


Bec(T) 2n+1-2k-4 if0<i<k 


eccr(u) 


ial Sh ap ae tS 2k. 


Equality holds if and only if T is a tree with longest path P = 2122... 222-1 (C= k 
in the first case and x = k +1 in the second), leaf u adjacent to zz, and each other 
vertex adjacent to either z or Zoz-9. Fori =k, the two bounds agree and both values 


of x will provide an extremal tree. 


Proof. Let T be a tree with n > 8 vertices and u € L(T). If T is a path, then 


ma < 3n 5 which is strictly smaller than the bounds in the theorem. 


For the remainder of the proof, we will suppose T is not a path. Fix P := P(w,w’) 


to be a maximum-length path in 7. For any leaf u € L(T) different from w and w’, 


Ecc(T’) Ecc(T) 
eccy (w) = eccy(u) 


. Because we are interested in an upper bound, it suffices to consider 
leaves u which are not on P. 

Let u be a leaf of T which is not on P and let x := eccr(u). There is a unique 
path from u to the closest vertex on P, say z. Then d(u,w) = d(u, z) + d(z,w) and 
d(u, w’) = d(u, z) + d(z, w’). Since d(u, w) and d(u, w’) are at most x and d(u, z) > 1, 


we have d(w, w’) = d(w, z)+d(z, w’) < 2x—2. Because P has maximum length, every 
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vertex on P realizes its eccentricity along P. Every vertex not on P has eccentricity 


at most 2% — 2. This gives the upper bound 


ae x . (« (Fee — 2)? + (2x — 2)) +(n — 2x) (2x — 2)) 


—x* + (2n+1)x — (2n — 1) 


i 
Equality is achieved precisely when T is a tree with longest path P on 2x —1 vertices, 
u is adjacent to the middle vertex of P, and all other vertices have eccentricity 2x — 2. 


Such a tree T is shown in Fig. 5.2. 


ie 
{v}=C(T) 
NE NZ u v w 
o——_‘« - - - ° + eo --- © e e----+ ° eo --—-- e 
—— Vi 
2x—3 internal vertices ae 


n—x—1 vertices 


Figure 5.2 A tree (left) which maximizes ae and a tree (right) which 
minimizes moe. 


It remains to determine the value of x that will maximize a for trees with 


eccy( 


the structure described above. The first derivative test shows that f(x) is maximized 


when 


x € {|V2n—1| ,|V2n—1]} C {kk + 1}. 
The larger of f(k) and f(k+ 1) gives the appropriate upper bound in (5.5). In addi- 
tion, we must require 27 < n in order to have a realizable tree. One can individually 
check that this is the case for n € {8,9,...,12}. When n > 13, we have k > 5 in 


which case 0 < k? — 4k — 3 which implies 2x2 < 2(k +1) <n. | 


Theorem 5.6. Let T be a tree of ordern > 5. Let k andi be nonnegative integers 


with 0 <i< 2k and 4n—4=k? +i. Then for any leaf u, 


root) [P+$+e — itkiseve a 
eccr(u) ~ | : fats . 
+8 Tea if k is odd. 
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Equality holds if and only if T is a tree with longest path P of length 2x (2x =k for 
the first case and 2x = k +1 for the second) with all other vertices adjacent to the 
middle vertex of P as shown in Fig. 5.2. Wheni =k, both bounds in (5.1) give the 


same value and both give extremal structures. 


Proof. Let T be a tree and u € L(T). Let x := eccr(u) and choose w € L(T) so 
that d(u,w) =x. Let P := P(u,w). The vertices on P have eccr(u) > eccp(u). The 
eccentricity of any vertex not on P is at least 1+ 5 with equality if x is even and 
these vertices are adjacent to the center vertex of P. This gives the following lower 


bound: 


Ecc(T’) > 1 (52 r) + (n—x—1) (1+5)) =: f(z) 


ecer(u) — x \\4 2 


where equality holds when P has even length and all vertices not on P are adjacent 
to the center vertex of P as in Fig. 5.2. Examination of f(a”) shows that the ratio is 


minimized when 


x € {|V4n—4] ,|V4n—4]} C {kk + 1}. 


We already established that the lower bound is tight for even x. For the universal 
lower bound, we let x = k if k is even and k+1 otherwise. Both will yield a realizable 
tree because x < n—1 for n > 5. It is also important to note that if 4n —4 is a 
perfect square, then k = |\/4n —4| = [/4n —4] = 2\/n —1, an even value. The 
lower bounds in (5.1) are exactly f(k) and f(k +1). For thoroughness, it can be 
verified that f(k) < f(k +2) and f(k+1) < f(k —1), for k > 1 to show that our 
choice of the even integer nearest /4n — 4 was correct for this concave up function. 

For thoroughness, it is shown below that f(k) < f(k+2) and f(k+1) < f(k-1), 
for k > 1. (One can quickly check that the bound also holds for n = 3 when k = 1.) 
Thus our choice of the even integer nearest /4n — 4 was correct for this concave up 


function. 
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fk+2)-f(k) = D5+5-5 
2k(n — 1) + k(k + 2) — 2(k + 2)(n — 1) 
2k(k + 2) 
k? + 2k — (4n — 4) 
2k(k + 2) 
k? + 2k = (k? 43) 
2k(k + 2) 


IV 
ro) 


—(k—1)(k+1) + 2(n— 1)(k +1) —2(n—1)(k-1) 
2(k —1)(k +1) 

—k? +1+4+4(n—-1) 

2(k —1)(k +1) 

—k?+1+k? +i 

2(k —1)(k +1) 

i+] 
2(k —1)(k +1) 
> 0. 


5.1.3 ON THE EXTREMAL VALUES OF %2\) WHERE u € L(T) , v € C(T) 


eccr(v) 
Theorem 5.7. Let T be a tree on n > 3 vertices with u € L(T) andv € C(T). Then 


eccr(u) 


<2 
ecer(v) ~ 


where the upper bound is tight for stars, even length paths, and more. If, in addition, 


n> 5, then 


Equality holds if and only if T is one of the following trees: (1) For anyn > 5, T has 
a longest path P onn —1 vertices with a single verter u adjacent tov € C(P). (2) 
For evenn > 5, T has a longest path P on n—2 vertices with u adjacent to v € C(P) 


and w adjacent to any internal vertex of P. These structures are drawn in Fig. 5.8. 


Proof. The upper bound of 2 follows from the facts that rad(Z) = ecer(v) and 
ecep(u) < diam(T’) < 2rad(7’). This bound is tight for all trees whose maximum- 
length path has an odd number of vertices and u a leave of one of these paths. 

Turning our attention to the lower bound, let T’ be a tree with n > 5 vertices. We 
first show that it holds for paths. If 7’ is a path, then 


CO i WE fe Re ie 1 
eccr(v) ~ 4 ian ai eat 


For the remainder of the proof, we assume T is not a path. Because n > 5, 
ecer(u) > ecer(v) + 1 with equality exactly when uv € E(T). In addition, because 
v € C(T), Observation 5.1 guarantees a maximum-length path P with v in the 
middle. Because T is not a path, P has at most n — 1 vertices and ecer(v) < ["5*]. 
These two inequalities result in the desired bound. 


ecer(w) are 1 ie 1 
Ea = 


eccr(v) ecer(v) 

Finally, let us analyze the trees T’ for which equality holds. Because P has at 
most n — 1 vertices, we first examine the necessary and sufficient conditions to have 
ecer(v) = ["57], based on the parity of n. If n is odd, then eccr(v) = "5* if and 
only if P has n — 1 vertices. For even n, ecer(v) = Le if and only if P has n — 1 or 
n — 2 vertices. 

Therefore, the bound in the theorem is tight exactly when T is one of the following 
two trees which are drawn in Fig 5.3: (1) T is a tree with longest path P on n—1 
vertices and leaf u adjacent to v € C(P). (2) For even n, T is a tree with maximum- 


length path P = 2129...2Zn_2 with u adjacent to v € C(P) and w adjacent to 2; for 


some i € {2,3,...,n— 3}. 2 
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U U 
e e e 
U U 
C o-----~ ©. o o-----+ ry °. o---- ° ° o o-----~ ©. ry 
Ce eS SS SS ———— 
[71] vertices — vertices 1S vertices at vertices 
eccr(u 


Figure 5.3 Trees which minimize en) the right one for even n only. 


eccy (wu) 
eccrp(w) 


5.1.4 ON THE EXTREMAL VALUES OF WHERE u, w € L(T) 


are reciprocals of 


First note that since the maximum and minimum values of cer (a 


eccr(w) 


each other, we only consider the maximum. 


Theorem 5.8. Let T be a tree with n > 4 vertices. For any u,w € L(T), we have 


eccr (wu) Bip. 
ecer(w) — 


N13 
os 


For even n, equality holds if and only if T ts a tree with longest path P = uzz3...Zn—1 
and leaf w adjacent to 2/2. For odd n, equality holds if and only if T is a tree with 
longest path P = u223...2%n—2, leaf w adjacent to 2n~1)/2 and leaf w adjacent to z; 


for some i € {2,...,2—3}. These constructions are drawn in Fig. 5.4. 


Proof. Let T be a tree and let u,w € L(T). For the upper bound, it is reasonable 


eccr (wu) 
eccr(w) 


to assume eccr(u) > eccr(w). If d(u,w) = ecer(u), then = 1| which is strictly 
smaller than the bound in the theorem. 
For the remainder of the proof, we focus on the case where d(u,w) < eccr(u). 


Choose y € L(T) so that eccr(u) = d(u,y) and let P := P(u,y). There is a unique 


path from w to the nearest vertex, say z, on P. Thus 


eccer(w) > d(w, z) + max{d(z,u),d(z,y)} >1+ E ecer(u)| 


where equality holds if d(w,z) = 1 and |d(z,u) — d(z, y)| < 1. We now consider two 


cases based on the parity of eccr(u). 
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First suppose x := eccr(u) isodd. Let S be the collection {y : d(u, y) = x}. Notice 
that w is not in S because d(u, w) < x. Now create a new tree F' from T by detaching 
each y € S and reattaching each as a pendant vertex adjacent to the unique neighbor 


of uin T. As a result, eccr(u) = x — 1, an even integer. By the above argument, 


ecep(w) > 1+ [2] =1+4 (x41) while eccp(w) > 1+ [Feccr(u)] = 1+ $(x — 1). 


eccr (u) and eccp(u) < tal The 


eccr(w) — 4 (a+3) eccr(w) — 5(x+1)° 


As a result, we obtain tight upper bounds 
second gives the larger upper bound. Since we seek a tight universal upper bound for 
the ratio, it suffices to consider only trees with u having even eccentricity. 

Assume eccy(u) is even. If n is even, then eccr(u) < n — 2 because w is not on 
P. However, we can tighten this to eccr(u) <  — 3 when n is odd because of our 
assumption about the parity of ecer(u). In either case, 1+ }ecer(u) < [8]. This 
give the desired bound: 


eccr(u) 2 ecer(u) 2 » 2) 


ecep(w) ~ 1+ }ecer(u) = 1+ Fecer(u) i 

Finally, we characterize the trees T’, based on the parity of n, for which equality 
holds. For even n, equality holds if and only if J is a tree with longest path P on 
n — 1 vertices with leaf w adjacent to the center of P. For odd n, equality holds if 
and only if T is a tree with longest path P on n — 2 vertices with leaf w adjacent to 


the center of P and leaf w adjacent to any internal vertex of P. This is exemplified 


by Fig. 5.4, with the additional leaf that occurs only for odd n in gray. a 
WwW e 
pes eo ----~- ae ----+4 y 


2 eal —3 internal vertices 


eccr (wu) 


Figure 5.4 A tree which maximizes ent 


Table 5.1 summarizes the results in section 2. Vertex labels appear as in the 


theorems. Specifically v is always in C(T) while each u and w are leaves of T. 
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Table 5.1 A summary of results for an arbitrary tree T on 
n vertices with v € C(T) and u,w € L(T). 


Bound Extremal Tree 


U 


< In — 2/In + b(n) Rr 


2V/2n—3 internal 


Stars, even length 
paths with pendant 
edges, etc. 


The quantities a(n), b(n), c(n) are bounded as follows: 1 < 
a(n) <5, -1 < b(n) <5, 0 <e(n) < 3. 


5.2 EXTREMAL STRUCTURES 


In this section, we fix a class of trees and find the ones in this class that maximize 
Ecc(T) and the ones that minimize Ecc(T). First, we consider the trees on n vertices. 
Then, we fix a degree sequence and search in the class of trees that realize this degree 


sequence. 
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5.2.1 GENERAL TREES 


For many indices, such as the sum of distances and the number of subtrees, the star 
and the path are extremal. Dankelmann, Goddard, and Swart (2004) showed that 
the path maximizes Ecc(7’) among trees with given order. We show that the star 


minimizes Ecc(T’) among trees with given order. 
Proposition 5.9. For any tree T with n > 2 vertices, 

Ecc(T) > 14+ 2(n-—1) = 2n-1 
with equality if and only if T is a star. 


Proof. Any tree with at least three vertices has at most one vertex which is adjacent 


to every other vertex (hence with eccentricity 1). Thus we have 
Ecc(T) > 1+ 2(n —1) = 2n—-1. 


Equality holds if and only if the single center vertex has eccentricity 1 and all other 


vertices have eccentricity 2. This characterizes the star. rT 


5.2.2. TREES WITH GIVEN DEGREE SEQUENCES 


Given a degree sequence, let 7 be the class of trees that realize this degree sequence. 
We determine which trees in 7 have total eccentricity equal to minpez Ecc(T) or 
max7e7 Ecc(T). We note that a sequence (d1, d2,...,dn) is the degree sequence for 


a tree if and only if 0%, d; = 2(n — 1) and each d; is a positive integer. 


General Caterpillars 


Among all trees with a given degree sequence, the sum of distances is maximized by a 
caterpillar (Zhang, Xiang, Xu, and Pan 2008) and the number of subtrees is minimized 


by a caterpillar (Sills and Wang 2015; Zhang and Zhang 2015). However, completely 


102 


characterizing the extremal caterpillar turns out to be a very difficult question in both 
cases. For the sum of distances, it is a quadratic assignment problem that is NP- 
hard in the ordinary sense and solvable in pseudo-polynomial time (Cela, Schmuck, 


Wimer, and Woeginger 2011). 


Definition 5.10 (Wang (2014)). For n > 3, let d = (d,,dy,...,dn) be the non- 
decreasing degree sequence of a tree with dy, > 1 and dyy1 = 1 for some k € [n — 2]. 


The greedy caterpillar, T, 1s constructed as follows: 
e Start with a path P = z,2... Zp. 


e Let bd: {a}%, > {d;}*_, be a one-to-one function such that, for each pair 


1,9 € [k], of eccp(z;) > ecep(z;) then b(%) > O(z;) - 


e For each i € {2,3,...,k — 1}, attach $(z;) — 2 pendant vertices to z;. For 


i € {1,k}, attach o(z;) — 1 pendant vertices to z;. 


Fig. 5.5 gives two examples of greedy caterpillars and highlights the fact that 


greedy caterpillars are not unique. 


AMAA MAAS 


Figure 5.5 Non-isomorphic greedy caterpillars for degree 
sequence (7,6,5,4,4,1,...,1). 


Proposition 5.11. Among trees with a given tree degree sequence, the greedy cater- 


pillar has the maximum total eccentricity. 


Proof. Fix a degree sequence d = (d,,...,d,) which is written in the form described 
in Definition 5.10. Let 7 be the collection of trees with degree sequence d. Let T € T 


be a tree such that Ecc(T) = maxpez Ecc(F’). We first show that T is a caterpillar. 
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For contradiction, suppose T is not a caterpillar. Let Pp(u,v) = wuju2...u,zv be 
a longest path in T. Let x € |k]| be the least integer such that u, has a non-leaf 
neighbor w not on Pr(u,v). Because P is a maximum-length path, x 4 1. Let W be 


the component containing w in T — {u,w}. 


a+1 Uxr+2 


ar AARA KK” 


Figure 5.6 Generating 7’ from T for the proof of 
Proposition 5.11. 


Create a new tree 7” from T by replacing each edge of the form zw in W with the 
edge zu. (Fig. 5.6). Notice that T and T”’ have the same degree sequence. However, 
for any vertex s € (V(T) \V(W)) U {w}, ecer(s) > eccer(s) because Pr(u,v) is a 


longest path in T. For any vertex r € V(W) — w, we have 
ecer(r) = d(r,u) + d(u,v) > d(u,v) > ecer(r). 


Thus Ecc(T") > Ecc(T), which contradicts the extremality of T’. 
Since T' is a caterpillar with internal vertices forming path P = uju2g...uz, the 
eccentricity of any internal vertex is independent of the interval vertex degree assign- 


ments. For any 7 € |k| and leaf w adjacent to uj, 
eccr(w) = max{k —i,i-—1} +2. 
If @: {u;}*_, + {d;}*_, is a one-to-one function, then when k is even, 
k 
Eec(T) =) ecer(ui) + (b(t) + O(ux))( + 1) + (P(u2) + O(tte-1))(&)+ 


+ ( (ue,2) + o(u (k+2 \/2)) (k/2 + 2). 
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In order to maximize the total eccentricity, for i,7 € [k], if 7 is closer to k/2 than 
i, then we should have ¢(u;) > ¢(u;). It is a greedy caterpillar which achieves this. 


The case when k is odd is similar. r 


Greedy trees and level-greedy trees 


In this subsection, each tree is rooted at a vertex. (While the root has no bearing 
on the total eccentricity, we use the added structure to direct our conversation.) 
The height of a vertex is the distance to the root and the tree’s height, h(T), is the 


maximum of all vertex heights. We start with some definitions. 


Definition 5.12 (Schmuck, Wagner, and Wang (2012)). In a rooted tree, the list of 
multisets L; of degrees of vertices at height i, starting with Lo containing the degree 


of the root vertex, is called the level-degree sequence of the rooted tree. 


Let |L;| be the number of entries in L;. It is easy to see that a list of multisets 
is the level degree sequence of a rooted tree if and only if (1) the multiset U; L; is 
a tree degree sequence, (2) |Zo| = 1, and (3) Suez,d = |£i|, and for all i > 1, 
diden,(d— 1) = |Lesil. 

In a rooted tree, the down-degree of the root is equal to its degree. The down 


degree of any other vertex is its degree minus one. 


Definition 5.13 (Schmuck, Wagner, and Wang (2012)). Given the level-degree se- 
quence of a rooted tree, the level-greedy rooted tree for this level-degree sequence is 
built as follows: (1) For each i € |n], place |L;| vertices in level i and to each vertex, 
from left to right, assign a degree from L; in non-increasing order. (2) Fori € |n—1], 
from left to right, join the next vertex in L; whose down-degree is d to the first d so 


far unconnected vertices on level L;4,. Repeat for i+ 1. 


Definition 5.14 (Wang (2008)). Given a tree degree sequence (d, dz,...,d,) in non- 


increasing order, the greedy tree for this degree sequence is the level-greedy tree for 
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the level-degree sequence that has Ly = {d,}, Ly = {do,...,da,41} and for eachi > 1, 


Lil = D7 (@-1) 
deLj_1 


with every entry in L; at most as large as every entry in Lj_. 


Fig. 5.7 shows a greedy tree with degree sequence (4, 4, 4, 3, 3,3, 3,3, 3,3, 2,2,1,...,1). 


Figure 5.7 A greedy tree. 


By definition, every greedy tree is level-greedy. However, Fig. 5.8 shows a level- 


greedy tree that is not greedy. It has level degree sequence: 


1145. 1Oses SP Aeron oy 221s he Hee Tt Tae. 


KN @ 


Figure 5.8 A level-greedy 
tree. 


For a fixed degree sequence, greedy trees minimize the sum of distances (Schmuck, 
Wagner, and Wang 2012; Wang 2008; Zhang, Xiang, Xu, and Pan 2008) and maximize 
the number of subtrees (Andriantiana, Wagner, and Wang 2013; Zhang, Zhang, Gray, 
and Wang 2013). We will show that they also minimize Ecc(7’) among trees with a 
given degree sequence. 

Here we provide some set-up for the proofs of the next two theorems. See Fig. 5.9 


for an illustration. Given a tree T rooted at v, let T; be the subtree, rooted at child 
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v, of v, containing some leaves of height h := h(T). Let h’ := h(T — 7T,). Then for 


any vertex u € V(T’— 7;) and any w € V(7}) with hr(u) = hr(w) = j, then 


eccr(u) =j+h, (5.2) 


eccer(w) = max{j + h’,eccr,(w)} <j +h (5.3) 


where the first is only dependent on the height of T and the second depends only on 


h’ and the structure of T}. 


V1 


w (height 7) u (height 7) 


T-T, 


T; 
‘ (height h’) 


(height h) 


Figure 5.9 A tree rooted at v with daughter subtree, 7}, 
containing leaves of height h. 


The following lemma implies that the level-greedy tree has the minimum total 


eccentricity among all rooted trees with a specified level-degree sequence. 


Lemma 5.15. Let ¢ be a non-negative integer. Among the trees with a given level- 
degree sequence, the level-greedy tree maximizes the number of vertices having eccen- 


tricity at most &. 


Proof. We proceed by induction on the number of vertices. The base case with one 
vertex is trivial. 

Fix > 0. Let T be a rooted tree with the given level-degree sequence and the 
maximum number of vertices with eccentricity at most @. (ie. T is optimal.) For 
vertices w € JT, and u € T — Tj, both of height 7, suppose for contradiction that 


deg(u) > deg(w). Create a new tree T’ by moving deg(u) — deg(w) children of wu and 


107 


their descendants to adoptive parent w. This effectively switches the degrees of u and 
w while maintaining the level degree sequence. 

While eccy:(u) = eccr(u), notice that h’ did not increase and neither did eccr(w) 
for w € V(T}). Since eccy:(w) < max{j +h’, eccr, (w)} = ecer(w), if strict inequality 
holds, then we have contradicted the optimality of T. Otherwise, JT’ and T are both 
optimal trees. In this case, we can repeat this shifting of degrees for pairs of vertices 
of height 1, followed by pairs of vertices of height 2, and so on until we either meet a 
contradiction or construct an optimal tree in which deg(u) < deg(w) for all w € Ty 
and u € T — T, of the same height. Assume that our optimal T has this property. 

Now we have a partition of the level-degree sequence for JT’ into level-degree se- 
quences for T’— T,. By the inductive hypothesis, we may assume that both 7) and 
T — T, are level-greedy trees on their level-degree sequences. As a result, T’ is a 


level-greedy tree. : 


The next theorem also yields a stronger result than merely minimizing total ec- 


centricity among trees with a given degree sequence. 


Theorem 5.16. Fir € € Z2°. Among the trees with a given degree sequence, the 


greedy tree maximizes the number of vertices with eccentricity at most ¢. 


Proof. Let T be a tree with the given degree sequence with the maximum number of 
vertices with eccentricity at most @. (i.e. T is optimal.) Many times we will use the 
following claim: For two vertices u and v with h(u) < @ < h(v), it is preferable to 
assign degrees such that deg(u) > deg(v) in order to maximize the number of vertices 
with height at most 2. 

Find a longest path in T and root T at a center vertex v of that path. In T— {v}, 
let T; be the component with the leaf of greatest height. Let v, be the child of v 
in 7. By our choice of the root, if h is the height of 7), then the height of T — 7; 
has height h’ € {h—1,h}. Now for any w € V(7;) with hr(w) = j, we have 
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eccp,(w) < (j —1)+ (h—1) <j +h! —1. In light of (5.3), 
ecer(w) = max{j +h’, eccr,(w)} =p +h’. 


For w,x € V(T)), if hr(w) < hr(x) then, by our earlier claim, ecer(w) < eccr(z) 
which implies deg(w) > deg(x) in T because T maximizes the number of vertices 
with small eccentricities. 

Vertices in T — T, with height 7 have eccentricity 7 + h by (5.2). So for u,v in 
V(T —T;,), when hr(u) < hr(v), we can conclude deg(u) > deg(v) in T. 

These observations establish the fact that either the root of T — 7; or the root of 
T, has the largest degree in 7’. 

We now examine two cases based upon the value of h’. When h = h’, we have 
ecep(w) = j+h = ecer(u) for any w € V(T;), u € V(T—T}) with hr(w) = hr(u) = 7. 
Therefore, for x,y € V(T), if hr(x) < hr(y), then deg(x) > deg(y) in T. As an 
immediate consequence, the root of T’ has the largest degree. 

When h’ = h — 1, we may assume that the root of T has the largest degree, for 
otherwise, we could reroot JT’ at vj which would not change the vertex eccentricities 
or the difference between h and h’. Continuing in the setting with h’ = h — 1, for 
wéEV(T) and u,y € V(T — 74), if hr(w) = hr(u), then eccr(w) = ecer(u) — 1. So 
deg(w) > deg(u) in T. However, if hp(w) > hr(y) +1, then ecer(w) > ecer(y). So 
we may assume deg(w) < deg(y) in T. 

In both cases, we may assume that vertices of smaller height have larger degrees. 
Consequently, this determines the level degree sequence of JT’. In fact, this is the level 
degree sequence for the greedy tree. The previous lemma asserts that we can assume 


T is level-greedy. Therefore, T is the greedy tree. rT 


Remark 5.17. Extremal trees for total eccentricity are not unique. In Theorem 5.16, 
we proved that the greedy tree had a stronger property. But in the proof, we can see 


that the greedy tree is not even unique in this regard. 
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Greedy trees with different degree sequences 


As a final remark on greedy trees, given a collection of degree sequences, we order 
the corresponding greedy trees by their total eccentricity. The following observations, 
similar to previous works on other indices, yields many extremal results as immediate 


corollaries. For an example of such applications see Zhang, Zhang, Gray, and Wang 


(2013). 
Definition 5.18. Given two non-increasing sequences in R", nm’ = (d},--- ,di,) and 
nm” = (d{,--- ,di), a” is said to majorize 1’, denoted n'< 1", if for k € [n — 1] 


k k n n 
Shy! and Saha Sdt 
i=0 i=0 i=0 


i=0 


Lemma 5.19 (Wei (1982)). Let a’ = (dj,---d} 


') and nm” = (d{,--- ,d”) be two 
non-increasing tree degree sequences. If n' <7", then there exists a series of (non- 


increasing) tree degree sequences 1 = (a, ...,d) for 1<i<m such that 


m—1) m) " 


gw =n) gq7@) gq... a7! an™ = x", 


In addition, each t and x“ differ at exactly two entries, say the j and k entries, 


j<k where dt) =a +1 and dl?) = dP -1. 


Remark 5.20. Lemma 5.19 is a more refined version of the original statement in 
Wei (1982). In this process, each entry stays positive and the degree sequences remain 
non-increasing. Thereby, each obtained sequence is a tree degree sequence that is non- 


increasing without rearrangement. 


Theorem 5.21. Given two tree degree sequences 1’ and x" such that 1’! <7", 
Eee(T,)- > Eco(Ts,) 


where T* is the greedy tree for degree sequence v. 
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Proof. According to Lemma 5.19, it suffices to compare the total eccentricity of two 
greedy trees whose degree sequences differ in two entries, each by exactly 1, ice., 
assume 


nm = ( 19777 d,) I ee ,d,) =n" 


with dj =d +1, dj =d,, — 1 for some j < k and all other entries the same. 
Let u and v be the vertices corresponding to d; and dj, respectively and w be a 
child of v in T*, (Fig. 5.10). Construct T, from T*, by removing the edge vw and 


adding edge ww. Note that T,” has degree sequence 7” and by Theorem 5.16 


Eec(Ti1) < Ecc(Ty). 


T 


viper Ti 


Figure 5.10 Creating 7,” from 7, when 
i (448 8i os 224 yao ad 
Te == (AA AB neal) y 


The height of any vertex in T,” is at most that of its counterpart in T*. An 


argument similar to that used in the proof of Lemma 5.15 shows 
Ecc(T;,) < Ecc(T™,). (5.4) 


Hence Ecc(T*,) < Ecc(Ty) < Ecc(T%,). = 


TT 


Remark 5.22. As in the proof of the extremality of greedy trees, equality holds more 
often in (5.4) compared with its analogue for many other graph invariants. This 
also serves as some indication that Ecc(T) is not as strong of a graph invariant as 


compared to others in terms of characterizing the structures. 


111 


By comparing greedy trees with different degree sequences, the extremality of 
trees with respect to minimizing Ecc(.) under various restrictions easily follows. 
Consider, for example, trees with a given number of vertices and exactly @ leaves. 
The degree sequence of such a tree has exactly @ of 1’s, where the degree sequence 
(¢,2,...,2,1,...,1) majorizes all other possible degree sequences. The corresponding 
greedy tree is a “star-like” tree (a subdivision of star). Similarly, for trees with a given 
number of vertices and maximum degree k, the degree sequence (k,k,...,k,@,1,...1) 
majorizes all other degree sequences with maximum degree k, where @ is the unique 
degree that is possibly between 1 and k. The corresponding greedy tree is called the 
“extended good k-ary” tree. See for instance, Bartlett, Krop, Magnant, Mutiso, and 


Wang (2014) or Zhang, Zhang, Gray, and Wang (2013) for details. 
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CHAPTER 6 


ON DIFFERENT “MIDDLE PARTS” OF A TREE 


There are many different vertex attributes that have been placed on trees. We are 
interested in the eccentricity, distance, and number of subtrees. Each attribute natu- 
rally defines a set as the middle of the tree. For example, the study of the center and 
centroid (Definitions 6.1 and 6.2) can be traced back to Jordan (1869). We explore 
extremal problems regarding the distance between the vertices in the different middle 


parts. 


6.1 DEFINITIONS AND CHARACTERIZATIONS 


Below we define each attribute. The distance in the tree from u to v, denoted d(u, v), 


is the number of edges on the unique connecting path P(u,v). 


Definition 6.1. The eccentricity of a vertex v in a tree T is the largest distance that 


one can travel in T when starting at v. More specifically, 


eccr(v) = Rees d(v, u). 


The center of T, denoted C(T), is the set of vertices which have the minimum eccen- 


tricity among all vertices in the tree. 


Definition 6.2. The distance of a vertex v, denoted d(v), is the sum of distances 


from v to each other vertex in T, 


div) = S° d(v,u). 


ueV (T) 
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The centroid of T, denoted CT(T), is the set of vertices which have the minimum 


distance among all vertices in the tree. 


A subtree of tree JT’ is a connected subgraph which is induced on a set of vertices. 


We consider T' to be a subtree of itself and a single vertex is also a subtree of T’. 


Definition 6.3. As the name suggests, the number of subtrees of a vertex v, denoted 
Fr(v), is the number of subtrees of T which contain v. The subtree core of a tree T, 
denoted by Core(T), is the set of vertices that maximize the function Fr(.) (Székely 
and Wang 2005). 


If H is a forest and v is a vertex in H, then Fy(v) will be defined, as above, to 
be the number of subtrees of H which contain vertex v. In particular, all subtrees 
which are counted must be subtrees of the component of H which contains vertex v. 

Jordan (1869) found that C(T) consists of either one vertex or two adjacent 
vertices (see also Ex. 6.21a in Lovdsz (2007)). Given the vertices along any path 
of a tree, the sequence of F’(.) function values is strictly concave down (Székely 
and Wang (2005)), the sequence of d(.) function values are strictly concave up (Ex. 
6.22 in Lovdsz (2007); Entringer, Jackson, and Snyder (1976)), and the sequence of 
eccr(.) function values are concave up (Ex. 6.21 in Lovdsz (2007)). Strict concavity 
immediately implies that the sets CT(T) and Core(T) either consist of one vertex or 
two adjacent vertices. 

We are specifically interested in how the middle sets are located, relative to one 
another. It is well-known that C(T) and CT(T) can be far apart (Ex. 6.22c in Lovasz 
(2007)), and that Core(T) can differ from them (Székely and Wang 2005). 

There are some natural questions that we will explore. How close to each other 
can they be? How far apart can they be spread? Must they lie on a common path? 


Can they appear in any ordering? 
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It is easy to find trees where C(T), CT(T), and Core(T) coincide, like the star 
and paths of even length to name a few. It is more interesting to see how far apart 
these middle sets can be in a single tree. 

As any two edges always lie on a common path, the vertices from any two of the 
sets C(T), CT(T), and Core(T) always lie on a path. However, it is possible that 
the vertices from C(T), CT(T), and Core(T) in the same tree T do not all lie on a 


common path. Figure 6.1 provides an example of this very situation. 


14 vertices 


~ 11 vertices 
15 vertices 


Figure 6.1 A tree with v € C(T), u€ CT(T), 
w € Core(T) which do not lie on a common path. 


On the other hand, when the vertices of C(T), CT(T), Core(T’) happen to lie on 
the same path, they can appear in any order. Figure 6.2 provides some illustrations. 

Among the examples with different ordering of middle vertices, it is interesting 
to observe that vertices in Core(T) generally have large degree; vertices in C(T) 
generally have small degree; while vertices in CT(T) behave somewhat between the 
previous two. 

Here, we formalize some necessary and sufficient conditions for a vertex to be in 
a middle part. While not novel, these propositions and their proofs are included for 


completeness. 


Proposition 6.4. Let T be a tree with at least two vertices. A vertex v is in the center 
C(T) if and only if there are two leaves, u and w, such that P(v,u) A P(v,w) = {v}, 


d(v,u) = eccr(v), and d(v, w) > eccr(v) — 1. 
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Figure 6.2. An example of trees with vertices 
vé€C(T),u€ CT(T), w € Core(T) ona 
common path, but in different orders. 


Proof. Fix v € C(T). We consider two cases based upon the value of eccr(v) =: x. 
If x = 1, then T is a star with v in the center. In particular, if there are more than 
two vertices, any two leaves will serve the purpose. If T consists of only two vertices, 
they will both be leaves which meet the criteria. 

Otherwise x > 2. By the definition of eccentricity, there is a leaf u such that 
P(v,u) has x edges. Further, for any other leaf w, the length of P(v, w) is at most x. 

Let T’ be the forest obtained from T by deleting the vertices of P(v,u), except v. 
Let H be the connected component of T’ which contains v. Suppose for contradiction 
that, for every w € V(H), dy(v,w) < x — 2. Let v’ be the unique neighbor of v on 
P(v,u). Observe dr(vu',u) = «—-1 and dr(v',w) < dy(v, w) +1 = dr(v,w)+1 < «-1 
for every w € V(H). For any leaf a € L(T \ A), dr(v’,a) +1 = dr(v,a) < x. Thus 
eccr(v’) = « —1. This contradicts the choice of v because eccr(v’) < eccr(v). 
Therefore there is at least one w € L(H) has dy(v,w) € {x — 1, x}. 

On the other hand, if a vertex v has the property that there are leaves u, w with 
disjoint paths P(u,v) and P(v,w) with d(v, u) = eccr(v) and d(v, w) > eccr(v) — 1, 


then the eccentricity of the vertices of P(u, w) is at least that of v. All other vertices 
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have eccentricity at least one more than eccy(v). Therefore v € C(T). = 


Corollary 6.5. If there are two leaves u,w such that d(v,w) = d(v,u) = eccr(v), 
then C(T) = {v}. If no such w exists, then |C(T)| = 2 where the neighbor of v on 


P(u,v) is also in the center. 
Next we give a characterization of the vertices in CT(T). 


Proposition 6.6. Let T be a tree with at least two vertices. A vertex u is in the 


centroid CT(T) if and only if for each neighbor v of u, we have 
Nuy(V) < Nay (u) 


where Nuy(u) (Nuv(v)) denotes the number of vertices in the component containing u 


(v) in T — uv which is the result after the deletion of edge uv from T. 
Proof. For any two neighboring vertices u and v, it is easy to see 
d(u) = d(v) + nuv(v) — Nu (u). (6.1) 


Therefore d(u) < d(v) exactly when nuy(u) > nuv(v). Indeed, along any path uvw, 
Nuv(U) < Nyw(v) because when vw is a removed, the component containing v is 
a superset of the component containing u when the edge wv was removed. (The 
inequality is strict because the first set contains vertex v while the second does not.) 
Similarly, n..(v) > Nyw(w). Whenever d(u) < d(v), the inequalities established here 
imply 
Nyw(W) — Nyw(V) < Nuy(V) — Nuy(u) < 0, 

from which we can conclude d(v) < d(w). Therefore, whenever wu is a vertex such 
that d(u) < d(v) for each of its neighbors v, then wu is in the centroid. The converse of 
this statement holds by the definition of the centroid and making use of equation 6.1 


which implies that d(u) < d(v) if and only if ny,y(v) > nyy(u). : 


Lastly, Proposition 6.7 gives a characterization of Core(T). 
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Proposition 6.7. A vertex u is in Core(T) if and only if for each neighbor v of u, 
we have 


Frew (u) = Pp eaps Coys 


Proof. For any two adjacent vertices u and v, let Ty,,(u) denote the component of 
T — uv which contains u. Each subtree H of T’ which contains v is characterized by 
two subtrees: the intersection of H with T,,,(u) and the intersection of H with T),,(v). 


Note that the intersection with T),,(w) may be empty. Therefore 
Fr(v) i Pipcag(®) (Frwy (u) =e 1) == Fr_w(v) Frwy (u) +t Fpany(@). 


Likewise 

Fe(u) = Frew (u) Frew (v) + Frew (wu). 
Therefore, Fr(u) > Fr(v) exactly when Fr_y,(u) > Fr_u(v). Now if u € Core(T), 
then Fr(u) > Fr(v) for all v € V(T) and consequently Fr_wy(u) > Fr—w(v) for each 
neighbor v of u. This proves one direction of the proposition. 

For any path uvw, notice that each subtree of T — uv which contains u can be 
identified with a subtree of T’— vw which contains v. Just include the edge uv in the 
subtree. Therefore Fy_yy)(u) < Fr_ww(v). (The inequality is strict because {vu} is a 
tree in the second set which will not be identified with any tree in the first collection.) 
A similar argument shows Fy_yw(w) < Fr_w(v). 

Now assuming F'(u) > F(v) which holds exactly when Fr_,,(u) > Fr—w(v), 
we determine 

Prag (Ww) Ppl) SP pil th) PP a) (6.2) 
This is equivalent to F’r(w) < Fr(v). 

Now if every neighbor v of vertex u has the property FPy_y,(u) > Fr—w(v), then 
Fy(u) > Fr(v). We have shown that this extends to the neighbors w of v such that 
Fy(w) < Fr(v) and the argument continues out to every vertex in the graph implying 


that u is in Core(T) since it has the largest number of subtrees of J’ containing it. m 
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6.2 MAXIMUM DISTANCES BETWEEN MIDDLE PARTS IN GENERAL TREES 


Fix an arbitrary n € Z*. Among all trees with n vertices, we determine the maximum 
distance that can be realized between vertices from different middle parts. We will 


see that the maximum distances are achieved by the structure named “comets.” 


Definition 6.8 (Barefoot, Entringer, and Székely (1997)). An r-comet of order n is 


formed by attaching n — r pendant vertices to one end vertex of a path on r vertices 


- e@ -—-----¢* ry 
ee 
r vertices 


Figure 6.3. An r-comet of order n. 


(Figure 6.38). 


For vertex sets S,.S” in a tree T, the quantity min{d(u,v) :u € S,v € S’} will be 
denoted d(S, S’). 


6.2.1 BETWEEN CENTER AND CENTROID 


Theorem 6.9. Fiz an arbitrary n € Z*. For any tree T with n vertices, 


d(C(T),CT(T)) < |" = =| | 


Proof. Fix a tree T on n vertices. Let v € C(T) and u € CT(T) such that the graph 
distance between u and v is precisely d(C(T),CT(T)). Therefore, no vertex on the 
path P(u,v) other than wu and v is in the center or the centroid of T. 

Let P(u,v) denote the path connecting u and v and let T,, denote the component 


containing u in T — E(P(u,v)). By Proposition 6.6, 
[V(Tu)| > 2 — |V(Lu)]- 
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This implies 
n 
ay =, 
Vir >% 


Let w be a leaf such that P(v,w) and P(u,v) are disjoint, except for v, and the 
length of P(v,w) is maximum. Because v € C(T) and the neighbor of v on P(u, v) 


is not in C(T), Proposition 6.4 tells 
d(v,w) = eccr(v). 
Since u is not a leaf, it is easy to see that 


d(u,v) < ecer(v) — 1. 


Jw C= ey 


Figure 6.4 A tree T with u € CT(T), 
v€C(T), w € L(T), and all vertices 
not in T,, are on the path P(u,w). 


Therefore, we have 


5 > n- (VI (6.4) 
> d(u,v)+d(v,w) (6.5) 
> 2Qd(u,v) +1. (6.6) 
This implies 
R=2 
d(u,v) < t 


In particular, ifn = 4k +r with r € {0,1,2}, then 


1. 4k4+r-2 
p= = oe 
2 4 = 


n-2 when n = r mod 4 for r € {0,1,2}, d(u,v) < k —1 where 


Since d(u,v) < 3, 


k= [8]. 


F220 


When n = 4k + 8, 


As a result, d(u,v) < k. = 


Proposition 6.10. Let k := Kae Equality holds in (6.3) exactly when n,T fall into 


one of the following categories: 
e n= 4k and T is the 2k-comet. 
en=4k+1 orn=4k+2 and T is one of the following trees: 


— 2k-comet 


— 2k-comet on n—1 vertices together with one vertex pendant to one of the 


internal vertices of P(u,w) 


— P(u,w) has 2k+1 vertices and T,, is a tree which is rooted at u, has height 


at most 2, and has n — 2k — 1 non-root vertices. 
e n=4k+3 andT is a (2k + 2)-comet. 


Proof. If n is even, then = > n — |V(T,)| is equivalent to $ —1 > n— |V(Zy)I. 


Therefore 


5 ~1>n-—|V(T,)| > d(u,v) + d(v,w) > 2d(u,v) +1 


d(u,v) <—-1 


2 
4 
When n = 4k, the trees with d(u,v) = k — 1 are precisely those trees in which all 
of the inequalities above are equalities. In other words, T,, (which includes vertex u) 
has 2k + 1 vertices, all vertices not in T,, lie on the path P(u,w) and all vertices not 
on P(u,w) are pendant to u. This characterizes the 2k-comet. 


If n = 4k +2, then the above inequalities indicate d(u,v) < k—}. Because d(u, v) 


is an integer, d(u,v) <k—1. To analyze the extremal structures, first observe that 


Lt 


5 — 1 is even while 2d(u,v) +1 is odd. Therefore not all inequalities can be tight. 
Indeed, exactly one will be strict. If the first inequality is not strict, then we have 
a 2k-comet on n vertices. If the second inequality is the one which is not strict, we 
have a (2k+1)-comet on n—1 vertices with one vertex pendant to one of the internal 
vertices on the path P(u, w). If the last inequality is not strict, we have a tree rooted 
at u on 2k + 1 non-root vertices and has height at most 2, together with a disjoint 
path P(u, w) with 2k + 1 vertices. 


When n is odd, we have 


—1 
"> n—|V(T,)| > d(u,v) +d(v,w) > 2d(u,v) +1 


n—3 
< ——_. 


When n = 4k + 3, then d(u,v) = k will hold precisely when all inequalities are 
tight. This happens when T is a (2k + 2)-comet. 

When n = 4k +1, uot is even while 2d(u,v) + 1 is odd. Therefore exactly one of 
the inequalities must not be strict. If the first inequality is not strict, then we have 
a (2k + 1)-comet. If the second inequality is not strict, then we have a 2k-comet on 
n — 1 vertices with one vertex pendant to one of the internal vertices of P(u, w). If 
the last inequality is not strict, then JT), is a tree which is rooted a u, has height at 
most, and has 2k non-root vertices will the rest of the tree is just the path P(u, w) 


with 2k + 1 vertices. r 


6.2.2 BETWEEN CENTROID AND SUBTREE CORE 


Next we turn our attention to the centroid and the subtree core. 


Theorem 6.11. Let T be a tree with n > 8 vertices. Ifn > 2!82"I-1 + flog, n], then 


n—-1 


a(CT(L),Core(T)) < | | = Logs nj —1 
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with equality for the (n — |log,n| —1) - comet. Otherwise 


n—-1 


d(CT(T), Core(T)) < | mai e 


with equality holding for the (n — |log,n]|)-comet. 


Proof. Let u € CT(T) and v € Core(T) in a tree T with |V(T)| = n and the graph 
distance between u and v is precisely d(CT(T), Core(T)). Let P(u,v) denote the path 
connecting u and v and let T,,, T,, denote the components containing u, v respectively 
in T— E(P(u,v)). Set x := |V(Z,)| and y := |V(Z,)|. Ultimately, we desire an upper 


bound for d(u,v) together with an extremal example. Observe 
d(u,v) <<n-ax—ytl. 


Thus we desire lower bounds for x and y. 
Since u € CT(T) and the neighbor of u on P(u, v) is not in CT(T), Proposition 6.6 


implies 


More precisely, x > [2]. 
Next we bound y. Because v € Core(T) and the neighbor of v on P(u,v) is not 


in Core(T), Proposition 6.7 gives 
Fr,(v) > Fr_z,(w) 


where w is the unique neighbor of v on P(u,v). See Figure 6.5 for an illustration of 
how these pieces interact. 
Further note that every subtree in 7;, which contains v can be uniquely identified 


by the set of its vertices, excluding v. Thus, 
Fr, (v) < ore 
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A ZN 


Figure 6.5 A representation of tree T for the proof of 
Theorem 6.11 with path P(u,v), Ty, Ty, and w labeled. 


Note that equality holds if and only if every subset of vertices induces a tree which 
is the case exactly when T, is a star centered at v. On the other hand, since T' — T, 
is a tree, subtrees can be created from T' — T, by iteratively deleting a leaf, which is 


not v, of T — T,. The result is 
Fr_7p,(w) >n—y. 


Equality holds here if and only if T’— T, is a path with w as an end vertex. 


Putting these inequalities together, we see that when v € Core(T), 
2s ny. 


As a linear and an exponential equation in y, if 2%~' = n — yo, then for all y > yo, 


2¥-! > n — y. Therefore, we proceed by solving 2¥~!' = n — yo. Note that yo > 0. 


2b = n—yp 
yo-1l = logs(n =a Yo) 
Yo = logs(n—yo) +1 


/\ 


logo(n) + 1. 
Using the equation yo = log,(n — yo) + 1 and substituting into itself, we find 


yo = logs(n —logy(n — yo) -1) +1 


IV 


log,(n — log,(n) — 1) +1 


> log,(n). 
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The last inequality will be proven later for n > 8. 


As a result, we have the bounds 
logy(n) < yo < logy(n) + 1. 


Further, if yo < |log,n| +1, then 2¥-—1>n-—y precisely when y > |log,n| + 1. 


However, if yo > [log,n| +1 then 2¥-—1>n-—y precisely when y > |log,n| + 2. 
Note that these are the two condition in the theorem statement. 

Now we show logs(n—log, n—1) > logy n—1 for n > 8. First observe logyn < $—1 
for n > 8. Therefore 


nr 
l < -—l 
O£9 nN 5 


5 < n-—logsn-1 


log, (=) < log, (n — log, n — 1) 


logsn—1 < logy(n—logyn—1). 


When n > 8, our bounds for integers x and y give 


As mentioned earlier, this can be strengthen to d(u,v) < [S| — |log.n| — 1 if 
yo > |logyn| +1. However, this will only happen if 2!!°%"J < n— |logyn|— 1 as 
stated in the theorem. 

As for extremal trees, equality will hold in the upper bound for d(u,v) if T,, has 
exactly =| vertices and T), is a star while T — T,, is a path. This describes the 


C-comet where C = n — |log,n| or in the case where n > 282"! + flog,n] +1, 


C=n- |log,n|—1. = 
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6.2.3 BETWEEN SUBTREE CORE AND CENTER 

Theorem 6.12. For any tree T on n> 8 vertices, ifn > 2!82"l-! + flog, n] then 
a(C(L),Core(T)) < | 5(n— [los n| — 2)| 

which is tight for the K-comet with K =n — |logyn| +1. Otherwise 
a(C(T),Core(T)) < | 5(n— [loge n| - 1)| 

which is tight for the K-comet with K =n -— |log,n]. 


Proof. Let u € Core(T) and v € C(T) in a tree T with |V(T)| = n and the graph 
distance between u and v is precisely d(C(T), Core(T)). Use T,, (respectively T),) to 
denote the component containing u (v) in T — E(P(u, v)) and let y = |V(T,)]. 

Because v € C(T) and the neighbor of v on P(u,v) is not in C(T), there is a leaf 
w in T, with d(v, w) = eccr(v). As argued in the proof of Theorem 6.9, 


d(u,v) < ecer(v) —1 < d(v,w), 


2d(u,v) +1 <d(u,v)+d(v,w) <n-y. 


Note that these inequalities are tight for the (n — y + 1)-comet. 


Because u € Core(T’), we can conclude, as in the proof of Theorem 6.11, 
WY 1>n-y. 


Consequently, 
y > [log n|. 


Combining inequalities, we obtain the bound in the theorem statement: 


1 1 

d(u,v) < AG —y- 1)| < AG — |log,n| — 1) 

Recall from Theorem 6.11 that if n > 2!82"!-1 + flog, n], then 
y > |loggn|+1 
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and consequently we obtain the better bound 


a(u,v) <|5(»—y-1)| < |5(n— Logan] - 2). 


6.3 TREES WITH DEGREE RESTRICTIONS 


In this section, we narrow our sights to classes of trees that follow certain degree 
restrictions. First we fix a degree sequence and see some results for trees which 
realize this degree sequence. Next, we consider only binary trees in which all non- 
root vertices have degree 1 or 3 while the root may have degree 2 or 3. Lastly, we fix 
integers n,k and consider classes of tree with n vertices and maximum degree k. 
For trees with a maximum degree condition, we obtain results about the distance 
between their “middle parts.” In order to prove these, we first obtain results about 
the maximum or minimum number of root-containing subtrees a tree with a specified 
degree sequences can have. Note that among trees with n vertices, it is the path, 
rooted at one end, which minimizes the number of root-containing subtrees and the 
star, rooted at the center vertex, which maximizes the number of root-containing 


subtrees. 


6.3.1 ‘TREES WITH A GIVEN DEGREE SEQUENCE 


In Chapter 5, Definition 5.14, we defined the greedy tree, an extremal structure 
explored in many previous studies. 

Fix a degree sequence for a tree and distinguish a single value in this sequence 
which will be the degree of the root. Similar to the greedy tree, we define the rooted 


greedy tree. 


Definition 6.13. Let d = (d,,d2,...,dn) be a tree degree sequence in non-increasing 


order with degree d; identified as the root degree. Let (d\,d,...,d\,_,) be the degree 


n-1 
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sequence d in non-increasing order with d; removed. The rooted greedy tree for this 
degree sequence is the level-greedy tree for the level-degree sequence that has Lo = {dj}, 
Ly = {d),...,dy,} and for each j > 1, 
IE;;J= Do (d@-1) 
d€Lj_-1 
with the largest element in L; less than or equal to the smallest element in Lj_1. 


Figure 6.6 shows a rooted greedy tree with root degree 2 and degree sequence 


(CR eee ee Ps (6.7) 


ANS 


Figure 6.6 <A rooted greedy tree with 
degree sequence (6.7) and root degree 2. 


Among trees with given degree sequence, greedy trees are extremal with respect 
to many graph invariants. For example, the following result is for root-containing 


subtrees. 


Theorem 6.14 (Andriantiana, Wagner, and Wang (2013)). Fix a degree sequence d 
and a positive integer k. Among rooted trees with degree sequence d, the number of 
subtrees which contain the root and have exactly k vertices is maximized by the greedy 
tree. Consequently the greedy tree maximizes the total number of root-containing 
subtrees. 

Fix a degree sequence, distinguish one value in the sequence as the root degree, 


and fix a positive integer k'. Among rooted trees with this degree sequence and the 
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specified root degree, the number of subtrees containing the root and having k’ vertices 
is maximized by the rooted greedy tree. Consequently, the rooted greedy tree maximizes 


the total number of root-containing subtrees. 


6.3.2. BINARY TREES 


The study of binary trees is well motivated from its applications in phylogeny. Székely 
and Wang (2005) studied the number of subtrees of a binary tree. They found that 
the extremal structures are good trees, rgood trees, and caterpillars. In our terms, a 


good binary tree is a greedy tree with root degree 3 and degree sequence 
(Bi con Cnt occ 


and an rgood binary tree is a rooted greedy tree with root degree 2 and degree 
sequence 


ig: ee 2 enh 


A binary caterpillar consists of a path P with pendant vertices that make the degree 
of each internal vertex 3. 


Their results for the number of subtrees are as follows: 


Theorem 6.15 (Székely and Wang (2007)). Among all binary trees with n leaves, 
where every non-leaf vertex has degree 3, the good binary tree minimizes the number 


of subtrees. 


Theorem 6.16 (Székely and Wang (2005)). Among binary trees with n leaves, the 


binary caterpillar on n leaves minimizes the number of subtrees. 


As an immediate consequence of Theorem 6.14, we obtain the following results 


for root-containing subtrees. 


Corollary 6.17. Among all binary trees, the good binary tree has the maximum 


number of root-containing subtrees. 
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Corollary 6.18. Fir n € Z=°. Among all rooted binary trees with n vertices, the 
rgood binary tree is the unique tree which maximizes the number of root-containing 


subtrees. 


For binary trees, we can examine the distance between vertices of different middle 
parts in much the same way that we did in Section 6.2. While the exact calculations 


are quite messy, we conjecture the following result. 


Conjecture 6.19. Among binary trees of order n, the tree T, formed from identifying 
the root of an rgood binary tree with a vertex of maximum eccentricity in a binary 


caterpillar (Figure 6.7), maximizes the distance between 
1. the closest pair u € CT(T) andv € C(T), 
2. the closest pair u € Core(T) andv € CT(T), 


3. the closest pair u € Core(LT) andv € C(T). 


al] [17 


Figure 6.7 An extremal binary tree which is conjectured 
to maximize the distances d(CT(T), C(T)), 
d(Core(T),CT(T)), and d(Core(T), C(T)) for u and v as 
in Conjecture 6.19. The tree T;, is an rgood binary tree. 


6.3.3. TREES WITH BOUNDED MAXIMUM DEGREE 


In this section, we turn our focus to trees on n vertices, all of which have degree at 
most k. 
We previously defined good binary trees and rgood binary trees. In general, for 


each positive integer k, a good tree is a greedy tree with degree sequence 


while the rgood trees are rooted greedy trees with root degree k — 1 and degree 
sequence 
(GRR To 1): 
For any fixed k, these trees only exist for certain values of n. Therefore, we extend 
their definitions as follows so that we can create similar trees for any n > k. 
For positive integers n,k (n > k), a tree with order n and maximum degree k is 


called an extended good tree if it is a greedy tree with degree sequence 
Crees aOR sce 11) 


for some 1 < s < k (Figure 6.8). Notice that the degree sequence is determined by n 
and k. The value s is the remainder when we divide n — 1 by k. If n —1 = qk +58 


then there will be q vertices of degree k, one of degree s, and the rest will be leaves. 


Figure 6.8 An extended good tree with 33 vertices and 
maximum degree 4. 


Similarly, for positive integers n,k, the extended rgood tree with order n and 


maximum degree k, is a rooted greedy tree with root degree k —1 and degree sequence 
(k,k,...,k,&—1,8,1,...,1) 


for some 1 < s < k (Figure 6.9). The value of s will be the remainder when dividing 
n by k. 

Among all rooted trees with n vertices, maximum degree k, and root degree 
p <k-—1, we seek the one with the maximum number of root-containing subtrees. 


We call such a tree optimal. 
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Figure 6.9 An extended rgood tree with 29 vertices and 
maximum degree 4. 


Theorem 6.20. Among all rooted trees with n vertices, maximum degree k, and root 
degree p < k —1, the extended rgood tree maximizes the number of root-containing 


subtrees. 
To provide a proof for this theorem, we first establish two lemmas. 
Lemma 6.21. An optimal tree with n > k must have root degree k — 1. 


Proof. For contradiction, suppose JT’ is an optimal tree with root r having degree 
p<k-—2. Since n > k, there exists a child u of r that is not a leaf. Let v be a child 
of u and T,, be the subtree induced by v and its descendants. 

Define T’ := T—{uv}+ {rv}. Every root-containing subtree in T can be uniquely 
identified by its list of vertices. It is easy to see that each list forms a root-containing 
subtree in J’. However, T’ also has root-containing subtrees which contain v and not 
u. These do not appear in J’. Therefore J’ has more root-containing subtrees than 


T. This contradicts our choice of T. r 


Definition 6.22. For degree sequences m = (do,--- ,dn—1) and m’ = (dp,--- ,di,_4), 


r’n-1 


nm’ majorizes 7, denoted 1 <7’, if for each k € {0,--- ,n — 2}, 


k k n—-1 n-1 
i=0 i=0 i=0 i=0 


The following is a simpler analogue of Theorem 11 of Andriantiana, Wagner, and 


Wang (2013). We skip the routine argument. 


132 


Lemma 6.23. Let T and T" be rooted greedy trees on n vertices with root degree k—1. 
If T has degree sequence x and T" has degree sequence x’ where 7 <7’, then T’ has 


more root-containing subtrees than T. 


In the search for an optimal tree, Lemma 6.21 implies that it is sufficient to restrict 
our attention to trees with root degree k — 1. Because we are considering only degree 
sequences on n vertices with maximum degree k, it is easy to see that the degree 
sequence of the extended rgood tree majorizes all other such degree sequences. Thus, 
Lemma 6.23 then implies that the extended rgood tree for order n and maximum 


degree k as stated in Theorem 6.20. 


Remark 6.24. Note that, among all rooted trees of given order, root degree at most 


k —1, and maximum degree k: 
e the extended rgood tree minimizes the height; 


e the path (rooted at one end) minimizes the number of root-containing subtrees 


and maximizes the height. 


6.3.4 IN TREES WITH A GIVEN MAXIMUM DEGREE k 


Fix n,k € Z*°. Similar to the binary tree case, we restrict our attention to classes 
of trees which have order n and maximum degree k. Then we look for trees in this 
class which maximize the distance between different “middle parts.” Our findings are 


detailed in this section. 


Theorem 6.25. For fixed n,k € Z*°, any tree T with order n and maximum degree 


k has 
n- [2 | —h, 


2 


i) 


where 


This inequality is tight for the tree formed from an extended rgood tree and a path by 


identifying the root of the extended rgood tree with one end of a path. 


Proof. Select u € CT(T) and v € C(T) such that d(u,v) = d(CT(T), C(T)). Let T, 
and T,, name the components containing u and v respectively in T — E(P(u, v)). 


Counting the vertices in 7’, we obtain the inequality 
d(u,v) <n—|V(Ty)| — |V(To)| +1. (6.8) 


Because u € CT(T), Proposition 6.6 implies the following for the vertex in the 
centroid which is closer to the center vertices: 
|V(Tu)| > rn — |V (Tul, 


es] 
5 . 


Vir) | 


Set h, and h, equal to the heights of T,, and T, respectively. Because v € C(T), 


Proposition 6.4 implies the following: 
d(u,v) thy < hy, 
d(u,v) <hy — hy < |V(Z,)| — 1 — Aa. (6.9) 


The upper bound for d(u,v) is tight when h, = |V(T,,)| — 1, which happens exactly 
when T, is a path, and h, is minimum. 

By Remark 6.24, the minimum h,, is achieved when T), is the extended rgood 
tree. Since |V(T,)| > [| and the maximum degree is k, we can determine the 
height of an extended rgood tree with these conditions. The extended rgood tree 
with maximum degree h and height h has at most 37?_,(k — 1)' vertices. For T,, with 


n vertices, the height h will be the smallest value which satisfies 


VFS (b= 1) 


—_1-(k-1)" 
~ 1=(k=1) 

— (K-1)"t-1 
= =) 
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Next we solve for h. 


|V(T,)|(k —2) < (k-14-1 


In (\V(T,)|(k — 2) +1) < (h4+1) In(k—1) 


In (JV (Zu) I(k = 2) + 1) 
In(k — 1) 


—-1<h. 


Since h is the smallest value that satisfies the above inequality and |V(T.,)| = [=], 


we can conclude 
In ([#$4](k — 2) +1) 
In(k — 1) 


Without knowing |V (T,)| exactly, we can add (6.8) and (6.9) and solve for d(u, v) 


to obtain the desired upper bound for d(u, v): 


2d(u,v) <n—|V(Ty)| — he 


d(u,v) < 5 (n—|V(Tu)| — Pu) 


< a ]-*) 


IA 
| 
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Theorem 6.26. For fired n,k € Z*°, any tree T with order n and maximum degree 


k has 
n+l 


d(Core(T),CT(T)) <n—n' — oe 


where n’! is the minimum order of an extended rgood tree T,, with maximum degree k 
such that Fy,(u) >n—|V(TL,)|. This inequality is tight for the tree formed from an 
extended rgood tree and a path by identifying the root of the extended rgood tree with 


one end of a path. 
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Proof. Let u € Core(T) and v € CT(T) such that d(u, v) = d(Core(T), CT(T)). De- 
fine T,, and T, to be the components of T— E(P(u, v)) containing u and v respectively. 
Let h, and h, be the heights of T,, and T,, respectively. 

By Proposition 6.6, v € CT(T) and its neighbor on P(u,v) is not in the centroid 


precisely when 


aad 
5 - 


Vie) | 


By Proposition 6.7, u € Core(T) and its neighbor w on P(u,v) is not in the 


subtree core precisely when 


Fr, (u) > 1+ Fr_z,(w) > d(u,v) + Fr,(v) > d(u, v) + |V(T,)I, 


d(u,v) < Fr,(u) — Fr,(v) < Fr,(u) — |V(Z,)|- (6.10) 


The last inequality is tight if T,, is a path. 


Counting the vertices in 7’, we see 


n> du,v) +|V(Tu)| + |V(Zo)] — 1, 


1 
(u,v) Sn =|V(Ty)| = [V(Ta) +1 n= |] nl +1, 


where n’ is the minimum number of vertices in a tree T,, with maximum degree k 
such that Fr,(u) > d(u,v) + |V(Z)| = n—|V(Z,)| as in (6.10). Note that Fr, (w) 
is maximized by the extended rgood tree, giving the extremal tree in the theorem 


statement. a 


Theorem 6.27. For fired n,k € Z*°, any tree T with order n and maximum degree 
k; has 
1 
d(Core(L),C(L)) <n! ~|5(n—n! +] 


where h’ = S| —landn’ is the minimum number of vertices in the extended 


rgood tree T,, with maximum degree k such that F,(u) >n—|V(L,)|. This inequality 
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is tight for the tree formed from an extended rgood tree and a path by identifying the 


root of the extended rgood tree with one end of a path. 


Proof. Let u € Core(T’) and v € C(T) such that d(u, v) = d(Core(T), C(T)). Define 
T,, and T,, to be the components of T — E(P(u,v)) containing u and v respectively. 
Let h, and h, be the heights of T,, and T,, respectively. 

Because u € Core(T) and its neighbor on P(u,v) is not in the subtree core, as in 


(6.10), Proposition 6.7 gives 
d(u,v) < Fr,(u) — |V(T,)| (6.11) 


which is tight when JT, is a path. 
Because v € C(T) and its neighbor on P(u,v) is not in the center, as in the proof 


of Theorem 6.25, Proposition 6.4 gives 
d(u,v) <hy —hy < |V(LQ,)| — hu — 1. 
As in (6.9), this is also tight when J, is a path. 
Adding these two inequalities together we obtain the following bound. 
1 
d(u,v) < . (Fr, (u) — hy — 1). 


The upper bound is maximum when Fy, (u) large and h,, is small which is optimized 
when T;, is the extended rgood tree. 
If n’ is the number of vertices in T),, then because v € C(T) and T, is a path, then 


eccr(v) is at least half of the diameter of T which translates to 


Any tree on n’ vertices with maximum degree at most k will have height at least 
the height of the corresponding extended rgood tree. As determined in the proof of 
Theorem 6.25, 


In conclusion, 
1 
sud) ena Vaieian = AG — nl + h') . 


Further, this upper bound is maximized when n’ is minimized. However, n’ must still 


satisfying the condition F,(u) > d(u,v) + |V(T,)| =n —|V(T.)| from (6.11). = 


6.4. DIFFERENT “MIDDLE PARTS” IN TREES WITH A GIVEN DIAMETER D 


In this section, for fixed n, D € Z>°, consider classes of trees with n vertices and di- 
ameter at most D. The next two propositions follow from exactly the same arguments 


as those of Section 6.2, we skip the proofs. 


Proposition 6.28. For fixed D and large n, every tree T of order n and diameter at 
most D satisfies 
D- | 


a(C(T), CT(T)) < |= 


which is achieved by a D-comet. 


Proposition 6.29. For fixed D and large n, every tree T of order n and diameter at 


most D satisfies 


which is achieved by a D-comet. 


The argument for d(CT(T), Core(T)) is more complex. Fix D and n. Among all 
trees with diameter at most D and order n, fix a tree T’ which realizes the maximum 
value for d(CT(T), Core(T)). 

Select vertices u € Core(T) and v € CT(T) such that the graph distance between 
uand v is precisely d(CT(T), Core(T)). In T—E(P(u, v)), let T,, name the component 
containing u while T,, is the component containing v. Consider u to be the root of T, 


and v to be the root of 7). 
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Let w be the neighbor of u on P(u,v). Because u € Core(T), Proposition 6.7 

implies 
Fr,(u) < Fr_r,(w). 

Because v € CT(T) and its neighbor on P(u,v) is not in CT(T), Proposition 6.6 

implies 
|V(T.)| > n — |V(Z)I- 

Suppose 7), is not a star. Create a new tree JT’ from T by replacing 7, with a star 
T” which is rooted at u and has the same order as T,. Using the convention that 7’ 
and T” are the components containing u and v respectively in T’ — E(P(u,v)), we 


see that 7’ is the same tree as T,. First observe that 
Fr: (u) > Fr,(u) > Fr_r,(w) = Fra (w) 
which implies w ¢ Core(TZ") by Proposition 6.7. Further, 
IVT) = |V(Z.)| > n — |V(Z)| = n — |VT)| 


which implies the neighbor of v on P(u,v) is not in the centroid of T’ by Proposi- 


tion 6.6. Therefore 
d(Core(T’), CT(T")) > dr-(u,v) = dr(u, v) = d(Core(T), CT(T)). 


By the choice of T’, d(Core(T"), CT(T’)) = d(Core(T), CT(T)). So T” is a tree with 
diameter at most D and order n which maximizes d(Core(T), CT(T)). 

Now consider the structure of 7’ in T’. Say T! has x vertices and height h. 
Suppose 7” does not minimize the number of subtrees containing v for its height and 
order. Let T”” be a tree rooted at v with height at most h and order x which minimizes 
Fru (vu). Define T” to be the tree created from T"’ by replacing TY with 7%’. Observe 
that 
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which implies w ¢ Core(T”) by Proposition 6.7. Further, for 7” being the component 
of T” — E(P(u,v)) which contains v, 


VT) = VT) > vn — |V (Th) = 2 — |V (Ty). 


This implies, by Proposition 6.6, that the neighbor of v on P(u,v) in T” is not in 
CT(T") and 


d(Core(T"), CT(T")) > drn(u, v) = dr(u, v) = d(Core(T), CT(T)). 


However, TJ’ was chosen as a tree which maximizes the distance between the sub- 
tree core and the centroid. Therefore d(Core(T”),CT(T”)) = d(Core(T), CT(T)). 
Therefore, T” is also a tree with diameter at most D and order n that maximizes the 


distance between the subtree core and the centroid. 


Remark 6.30. Fix n,D € Z?°. Among all trees with diameter at most D and 
order n, there is a tree T, with T,, a star rooted at u and Ty, a tree which mini- 
mizes the number of subtrees containing v for its height and order, which maximizes 


d(Core(T),CT(T)). This structure T is drawn in Figure 6.4. 


In Section 6.5, we take a closer look at the structure of T,,, a tree which minimizes 
the number of subtrees containing v for its height and order. While we determine 
many necessary properties of T,,, characterizing the exact structure is still an open 


problem. 


Figure 6.10 The structure of a tree T’ with diameter D 
and order n which maximize d(Core(T),CT(T)). Here, 

u € Core(T), v € CT(T), and T, minimizes the number of 
subtrees containing v for its order and height. 
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6.5 ROOTED TREES OF GIVEN ORDER AND HEIGHT 


For any n,h € Z*°, this section is devoted to characterizing the rooted trees with n 
vertices and height at most h which have the minimum number of root-containing 
subtrees. We call these trees optimal. 

To standardize some notation, we restrict our attention to trees TJ’ which are 
rooted at root p, have order n and height h unless mentioned otherwise. Note that 
h(T) = eccr(r). The degree of a vertex v will be denoted deg(v). It is necessary to 
have h+ 1 <n to guarantee that the tree will be realizable. 

For any v € V(T), let T(v) denote the subtree induced by v and all of its de- 
scendants. We will view T(v) as a tree rooted at v. For each neighbor v; of p, set 
T; := T(v;). Here we present several observations regarding the characteristics of an 


optimal tree. 


Lemma 6.31. In any optimal tree T, for any v € V(T), T(v) minimizes the number 
of root-containing subtrees among all rooted trees of the same order and height at 


most h — hr(v). 


Proof. Let T be an optimal tree. Suppose, for contradiction, that there is a vertex 
v for which T(v) does not satisfy the lemma. In other words, there is a tree T’(v), 


which is rooted at v, has the same order as T(v), and has 
h(T’(v)) <h—hr(v) and Fry) (v) < Frey (v). 


Let T” be the tree obtained from T by replacing T(v) with T’(v). Then T and T” have 
the same number of subtrees containing p but not v. Define T* := T — (T(v) — {v}) 


and let p and v. Because T and T” only differ in the descendants of v, we have 
Fr(p) — Fr(p) = Fre (v) Fr-(p, 0) — Friwy)(v) Fre (p, v) > 0, 


a contradiction to the optimality of 7. : 


141 


Lemma 6.32. The height of any leaf in an optimal tree is h. 


Proof. lfn = h+1, it is straightforward to see that the path rooted at one end is the 
optimal tree. In the case when n > h+ 1, some vertex must have at least 2 children. 
Suppose, for contradiction, that there is a leaf v € V(T’) whose height is less than h. 
Let «x be the closest ancestor (possibly the root) of v that has at least two children. 


Let y be a child of x that is not on P(x,v) and z be the child of x on P(z, v). 


. vU 


Figure 6.11 Trees T(x) and T"(x) from Lemma 6.32 


Let T;, be the component containing x in T — xy — xz and consider the tree 
T'(v) = T(x) -—az+yz 


depicted in Figure 6.11. Note that T’(a) has the same order as T'(a) and has height 
no more than h — hr(x) because the height of v in T is less than h. 
Counting the number of subtrees containing x in each tree, we obtain the following 


equalities: 


Fre)(x) = Fr,(x)(1+dray(x, v))(1 + Fray (y)), 


Fpra)(t) = Fp,(x) [1+ (1+ dra(#,v))Fray(y)] - 
Together, these imply 
Fr(a)(&) — Frv(a)(t) = dr(e)(@, 0) Fr, (x) > 0. 


Since J’ was optimal, this contradicts Lemma 6.31. : 
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Lemma 6.33. Every optimal tree has one of the following two properties: 
e All non-root vertices have degree at most 8. 


e All non-root vertices of height less than h — 1 have degree at most 3. For any 
vertex v of height h—1, deg(v) < 4. Further, if deg(v) = 4, then the parent of 


uv must have degree 2 or be the root. 


Proof. As before, this proof proceeds by contradiction. Let x be a non-root vertex in 
an optimal tree T’ with degree at least 4. Say y, z, and w are three children of 2 and 
let wu be the parent of x. Denote by 7), and 7), the components containing u and x 


respectively in T — ua — xy — xz — xw. Without loss of generality, assume 


Frw)(w) = max{Frey(y), Fr)(z), Frwy(w)}. 


Figure 6.12 Trees T(u) and T’(u) in the proof of Lemma 6.33. 


Now consider the tree T’(u) obtained from T(u) by removing the edges xz and 
xw, inserting a path of length 2 between u and w, while identifying the vertices y and 
z (Figure 6.12). Note that 7’(u) has the same height and order as T(u). Counting 


the number of subtrees containing u in each, we find 


Fre(u) = Pr, (u) [1+ Fr,(a) (1+ Fryy(y)) (1+ Frey(2)) (1+ Frew (w))] 


Friyy(u) = Fr,(u) (2 + Frvw)(w)) [1 + Fp, (x) (1 + Fry) (y) Fria(z))| ; 
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Together, these imply the following 


Fr (u) — Fry (u) = Fr, (u) [Fro(0)Frey(y) (Frew) — Fre(2)) 
+ Fr, (2) (Frwy) (y) — 1) 
+ (Frw)(w) +1) (Fr, (2) Frey(z) — 1)] 


> 0. (6.12) 


Because T is an optimal tree, Tw) is an optimal tree by Lemma 6.31. Therefore 
(6.12) must be equality. Note that for any tree H and vertex a € V(H), Fu(v) > 1 
because the subtree containing only the vertex v will be counted. Therefore, equality 
holds in (6.12) exactly when Fp,(z) = Friyy(y) = Fryy(z) = Frwy(w) = 1, or 
equivalently, deg(x) = 4 and y, z, w are all leaves so x has height h—1 in T. Create T’ 
from T by replacing T(u) with T’(u). Because (6.12) is equality, Fr(u)(u) = Fruy(u)- 
Therefore T’ is also an optimal tree. 

In T", degr (x) = 3 but degr(u) = degr(u) + 1. Observe u has height h — 2 in 
T’. If wis not the root of T’ and degy:(u) > 4, then we can repeat the argument for 
optimal tree T’ and vertex u having degree at least 4. Because the height of u is h—2, 
we will find a contradiction in the step which parallels (6.12). Therefore degr(u) < 3 
which implies degr(u) < 2. Since u is not the root of T, we can conclude degr(u) = 2 


as stated in the theorem. r 


In the proof of Lemma 6.33, in the case where degr(x) = 4, we created another 
optimal tree T’ where degr (x) = 3 and no other degree 4 vertices where created. 
Hence, if an optimal tree has multiple degree 4 vertices of height h — 1, we can repeat 
this procedure to obtain an optimal T” with all vertices of degree at most 3. This 


proves the following observation. 


Observation 6.34. There is an optimal tree in which all non-root vertices have 


degree at most 8. 
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We now shift our attention to the structures of T; for 1 <i<k. 


Lemma 6.35. In an optimal tree T, each subtree T; U {p} falls into one of the 


following three categories: 
e There is at most one non-root vertex with degree 3. 


e All non-root vertices of height at most h — 3 have degree 2, the vertex of height 


h —2 has degree 3, and exactly one of its children has degree 3. 


e All non-root vertices of height at most h — 2 have degree 2 and the vertex of 


height h — 1 has degree 4. 


Proof. We prove this in two pieces, considering the alternatives from Lemma 6.33 
separately. We start with the optimal trees in which all vertices have degree at most 
3: 

For contradiction, suppose there exists a T; U {p} with at least two non-root 
vertices of degree 3. Let v be a degree 3 vertex of greatest height in 7; and let u,w 
be the two children of v. Let z be the closest ancestor of v such that degr,(z) = 3, z 
has parent x, and z has child y ¢ V(P(z,v)). Let ¢; denote the distance from v toa 
leaf in T; and 2 the length of P(v, z). Let T,, denote the component containing x in 
T(x) — xz (Figure 6.13). 

Create a new tree T’(x) from T(x) by removing the edges vw and zy, inserting 
a length 2 path between x and y, and identifying u and w (Figure 6.13). Note that 
T'(x) has the same height and order as T(x). The number of subtrees containing x 


in each is 


Fr(a)(«) = Fr, (x) [1 + (1+ Fray(y))[é + (& + 1)7]], 


Frv(a)(t) = Fp, (2)(2 + Frey(y)) (2 +244). 
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Figure 6.13 Transforming T(x) into T’(a) when degr(v) = 3 in 
the proof of Lemma 6.35. 


By Lemma 6.32, the height of each leaf in T is h, hence V(T(y)) > &; + £2. Now we 


have 


Fray(t) — Frvay(0e) = Fra (0) [0+ Froy(u))20 — 1) — (G+ & + 1)] 


> Fr,(x) [1+ 01+ 6)(24 -1)-(G+6+ 1] (6.13) 


= Fr (2\(@ + 261 bo + Ly = 2b» = 2) 


> 0. (6.14) 


When either (6.13) or (6.14) is strict inequality, we have a contradiction to the op- 
timality of T. Equality holds exactly when @, = ¢2 = 1 and |V(T(y))| = @: + 2. In 
other words, T'(y) is a single path on two vertices with y having height h — 1. Since 
T’ (constructed from T by replacing T(x) with T’(x)) is an optimal tree, if x is not 
the root, degr:(x) < 3 since x has height h — 3. Therefore degr(x) < 2 as described 
in the second property of the lemma. 

If T’ falls into the second category listed in Lemma 6.33, then consider a subtree 
T; with a vertex v of degree 4 at height h — 1. We will show that all other non-root 
vertices in T;U{p} must have degree 2. Suppose to the contrary that v has an ancestor 
z of degree 3. (In this way, we are able to simultaneously handle the case when there 


are two vertices of degree 4 in T; U {p} because they would have to share a common 
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ancestor of degree 3.) Label the vertices as before with s being the third child of uv 
(Figure 6.14). 


Create T’(x) by altering T(x) in a manner similar to that described above. Define 
T(x) = T(x) — wv — yz +2w + wy 


as shown in Figure 6.14. 


T(z) T"(x) 


Figure 6.14 Transforming T(x) into T’(a7) when v has degree 4 in 
the proof of Lemma 6.35. 


Let ¢2 be the distance from z to v in T(x). Because all leaves have height h, 
Fry(y) = 42 +1 which is tight when T(y) is a path. Now if we calculate Fy(.)(x) 


and Fp,)(x) exactly and take their difference, we find 


Fr(z)(t) = Fr,(a) (1 + (1+ Fray (y)) (2 + 8) 
Fria)(t) = Fr, (2)(2 + Frey(y))(é2 + 5) 
Frie)(x) — Frat) = Fr, (x) (8Frq)(y) — & — 1) 
> Fr,(x) (8(l2 +1) — & - 1) 
= Fr,(x) (2l2+ 2) 
> 0. 
This contradicts our choice of T. Thus T;U {p} can have at most one vertex of degree 


4 and all other non-root vertices must have degree 2 as described in the third property 


of the lemma. r 
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Once again, it is useful to note that the optimal trees described in the second 
two properties of Lemma 6.35, the proof described T’ analogues which have the same 
number of root-containing subtrees and yet fall under the first property description 


in Lemma 6.35. This gives the following observation. 


Observation 6.36. There is an optimal tree with each T; U {p} having at most one 


non-root vertex of degree 8. 


Define the f-split to be the tree rooted at v; with h+ f vertices, constructed from 
paths P; = (v1, v2,...,vpn), and Py = (ui, ue,...,us) by adding the edge ujup_y. We 
also define the 0-split to be merely a path on h vertices which is rooted at one end. By 
Observation 6.36, there is an optimal tree so that for each TJ; there is0 << kj <h—-1 
such that 7; is a k;-split. First let us state a structural observation that will minimize 


some notation. 


Observation 6.37. In an optimal tree T, the number of root-containing subtrees in 


a k,-split together with root p ts 
Sn(ki) = h+k? +k, +1. 


This definition also makes sense for the 0-split, which has h root-containing subtrees, 


and the h-split, with h? subtrees that contain the root. 


Lemma 6.38. Among the T; subtrees in an optimal tree, at most two of them can be 


0-splits. 


Proof. Suppose, for contradiction, that T;, T; and Tj, are each 0-splits in an optimal 
tree. Consider S := T; UT; UT), U {p}. Create S’ from S by replacing T; with a 


1-split, 7; with an (h — 1)-split and deleting Tj, (Figure 6.15). 
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Figure 6.15 Trees S and S’ from the proof of Lemma 6.38 


The difference in the number of subtrees is 


Fs(p) — Fs:(p) = (sn(0))® — sn(1)sn(h — 1) 
= (h+1)? —(h+3)(2h+4 (h —1)?) 
= 2(h—1) 


> 0. 


This contradicts the optimality of TJ’ because the tree obtained from T by replacing 


S with S’ has fewer root-containing subtrees than T. : 


Lemma 6.39. If some T; is a 0-split, then for each 7 41, T; is either a 0-split or a 


1-split. 


Proof. Suppose instead that T; is a 0-split and T} is a kj-split where 1 << kj <h—1. 
Let S be the tree induced by T;, T; and r. Construct 5" from S' by replacing T; with 


a 1-split and replacing T; with a (k; — 1)-split (Figure 6.16). 
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Figure 6.16 Trees S$ and S” from the proof of Lemma 6.39. 


Note that S’ has the same height and order as S$, and 


= (h+1)[h+ kh} +k; +1] - (+3) [A+ (kj - 1)? + by] 


=(h 1) [h k2 + ky 1] —(h 3) [h kj — ky +1 


= 2hj(h +1) -2[h+h?— kj +1] 
=2|kjh— ke + kj -—h+kj-]| 

= 2[(kj — 1)(h — ky) + (kj — D] 

a (for k; > 1) 


This contradicts the optimality of T’ because the tree obtaining from T by replacing 


S with S’ has fewer root-containing subtrees than T. : 


Lemma 6.40. A rooted tree T is not optimal if for any T; (kj;-split) and T; (k;-split), 


we have ki(1+k;) >h+1 forl<k, <k; <h-1. 


Proof. Define T2 be the subtree of T’ which consists of the root p together with T; 
and 7; 

Construct T) from 7) by replacing T; with a (k; — 1)-split and replacing T; with 
a (k; + 1)-split. This construction is well-defined because 1 < k; and kj <h—1. 


It is easy to see that T; has the same height and order as Tm. We have 
Frry(p) = sn(ki)sn(kj) = and = Fry (p) = Sn (ki — 1) 8a (ky + 1). 
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Since k; <k; and k;(1+k;) > h+1, we have 
Fp, (p) — Fry(p) = —2(ki — ky — 1) (ki + kikj — hb — 1) > 9, 
which contradicts the optimality of 7’. : 


By reversing the roles of k; and k; in the previous lemma, we obtain the following 


corollary. 


Corollary 6.41. A rooted tree T is not optimal if for any T;, which is a k;-split, and 


T;, which is a kj-split, we have kj(1+k;) <<h+1 and k; < kj; —1. 


Corollary 6.42. Fix an optimal tree T in which each T; is a k;-split with ky > kjx4. 
he > ~ate- *, then ky < \/h+ 3 = 5 for each i > 2. 


Proof. Let T be an optimal tree, as described in the corollary, with ky > \/h + 3 = S. 


For contradiction, suppose ky > \/h + : = 5 Observe 


ko(ky +1) > (Jv 8-3) (Jar S 3) 


el rR 


However, this contradicts the statement of Lemma 6.40. Therefore the corollary 


holds. r 


Corollary 6.43. Fix an optimal tree T in which each T; ts a k;-split. For any pair 


{ki,kj} with ki, kj < \/h+2— 5, we can conclude |kj — kj| < 1. 
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Proof. Suppose k;,k; < /h+ 3 = 5 with k; > kj + 2. Observe 


kj(1 +k) < ky(kjy — 1) 


(er) 


5 5 3 
as SoD dhe 
4 Nba 
5 
=h+2-Alht+— 
aa 
<h+l1 


This contradicts Corollary 6.41, finishing the proof. 


Question 6.44. Let root p have deg(p) =r and let T; be a k;-split where0 < ki < h 


for each i € |r]. If T is an optimal tree, then 


waka Lam, 


kisteky Shy POR Le a4- SNE Uae, 


Our goal is to minimize the expression 


k 
[[@+k +e 4+), 


i=1 


obtained from the number of root-containing subtrees of an optimal tree illustrated in 


Figure 6.17. 


While we do not yet have a complete characterization of optimal trees, we have 


many necessary properties. 


Lemma 6.45. Suppose ky > ky >... > ky. If ky > [até ae 3], then for each 


a>, 
b+] yep c Atl 
ky ky +1 
In particular, ky = kz =... = ky | S| provided Eeesa cl 
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Figure 6.17 The structure of a tree T with height 
h and order n which minimizes the number of 
root-containing subtrees. 


Proof. Since ky > \/h+ 3 — s, Corollary 6.42 implies kz < \/h + 2 — 7 Since T' is 
optimal, Lemma 6.40 yields k;(1 + ky) < ko(1+k,) <h+1. Thus 


ae n+l 
heed 


Because k, > /h + 3 - 3]; then necessarily kp < k; — 1. Corollary 6.41 gives 
the following: 
ki(l+ki) > kil +k) >h+1. 
This is equivalent to 


h+1 


1. 
ky 


k, > 


Remark 6.46. Let T be an optimal tree with n vertices and height h such that each 


T; is a k;-split with ky > kp > ... > k, which falls into one of the following three 


categories: 
1. (Paths) ky =0, kp-1 € {0,1}, Hg =... = ky = 1. 
2. (One large) ky > | h+3- | and k; = S| for eachi € {2,...,r} provided 
S| Z ae 7 
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3. (Even distribution) ky < |,/h+2—4]| and for alli,j € [r], |k; —k;| <1. 
£35 j 


Provided that the k;-values in an optimal tree follow an even distribution, we are 


interested in determining the optimal value for deg(p) = r. 


Lemma 6.47. For fixed h,n € Z, let T be an optimal tree with n vertices, height 


h, and where each T; is a k;-split. Fix t € R, t > 2 which satisfies the inequality 


AVY) > In(6h). For x € [Ave+D, ni] with n > (h+2)(h+a—1) +1, then 
l{i:k; =xz}|<h+a-1. 


Proof. Let T be a tree with root degree r and each T; is a k;-split. Suppose for 
contradiction that ky = kp =... = kp42-1 = x (where the k; values are not necessarily 
in non-increasing order). 

Let H be the subtree induced by 7),...,7h4,—-1 and the root p. Here, each T; is 


an x-split. Thus the number of root-containing subtrees in H is 
Fy(p) = (h+a?+a4+1)"2-1, 


Let TY be an (a — 1)-split. Define a new tree 7” by replacing T; with T/ for each 
i € [hk + x — Ij and increasing the degree of the root by one so that the new branch 
Ty is also an (x — 1)-split. Let H’ be the subtree induced by 79, 7j,...,7j,4,—, and 


the root of JT’. The number of root-containing subtrees in H’ is 
Fip(p) = (h+2? —2+1)"**, 


In order to compare the number of root-containing subtrees of T and T”, it suffices 


to compare the number of root-containing subtrees of H and H’. 
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In order to compare these, consider the ratio: 


Fr(p) _ Fu(p) 

Fr(p)  Fur(p) 
(h+ x? tat Uh a 
(h4+a2?—ax+41)h+* 


_ 1 (1 | an yo 
h+a?—a2+1 “A+e? —2+1 

l op i/(t+1) Athi/(t+1) 
=, h2/t hi/t + J ( h as h2/t _ hi/t+)) ate :) 


since x € [AV(4), pl] 


1 2 
pz/t+ Alte +1 (1 | At/t+) 4 Alt+2/(P+t) — 1 4 A-1/+D) 


h+ 
fi 2 h 
2 1 
— 3h ( At/(t+)) + aicanea) 
1 9) pt/(t+)) yl /(t+) 
2 1 
~ 3h ( Ai/(t+1) + aan) 
1 1 ht/(E+1) pl /(t+1) 
~ 3h ¢ ware) 
st Laven 
ore 2 
Pall 


) 


hth/(t+1) 


for t > 2 


since hV/(4+) > In(6h) 


As a result, JT’ is not an optimal tree because T” also has n vertices and height h 


but has fewer subtrees which contain its root. 


We obtain the following corollary: 


Corollary 6.48. Fix h > 550 and n € Z with n > 6h?. Let T be an optimal tree 


with subtrees T; which are k;-splits. For any x € (In(6h), vi , then 


l{¢:k =a2}|<h+a-1. 


Proof. Fix n,h with n > 6h?. Since h > 550, In(6h) < h'/3. Suppose a tree T has 


root degree r. Let x € (In(6h), Vhi. If In(6h) < x < h”, there is t € 
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R such that 


= AV) ¢ = PR—-1) andt> 2. If kV <2 < hi? , let ¢ = 2 in which case 


Ing 


AVG) — f1/3 > In(6h) for h > 550. 7: 


In the statement of Corollary 6.48, we require n > 6h?. First this ensures that T 
can possibly have (h + a — 1) subtrees which are 2-splits for  < Wh. On the other 
hand, Corollary 6.43 says |k; — k;| <1. Ifn > 6h?, then we guarantee that T will 


have at least h + « — 1 subtrees T;, each of which is an z-split, even when x = Vh. 


Remark 6.49. Therefore ifn > 6h? with h > 550, then each k; < In(6h) +1. Even 


in the “one large” case, the value of ky must be large enough so that kj = Fest << 


ky+1 


In(6h) + 1. 


While we do not yet have a complete characterization of the optimal trees, we 
have established many of their structural properties to guide our continued study on 


this topic. 
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CHAPTER 7 


SOME REMARKS ON BARANYAI’S THEOREM 


About 40 years ago, Baranyai (1973) found a proof using network flows for a long- 


standing open problem about set partitions: if k|n, then all () k-element subsets 


n-1 


a) families, such that each family is a partition of the n- 


can be partitioned into ( 
element underlying set. The proof uses a polynomial time algorithm to develop such 
a partition. However, the algorithm sheds little light on the construction. Before 
Baranyai’s proof, the existence for k = 2 was well-known and easily constructible as 
seen in Fig. 7.7 taken from Lint and Wilson (1996). Peltesohn (1936) proved the 
existence for k = 3 by combinatorial arguments. For k = 3 an algebraic construction 
also exists due to Beth (1974). While no explicit construction has been found for 
larger sets, such a construction may prove enlightening about finite sets as Baranyai’s 
theorem is a strong result. For example, it implies the Erdds-Ko-Rado theorem in 
one line for kn. 

Erdés and Székely (1989) was the first to establish a bijection between set par- 
titions and rooted leaf-labelled trees. It is our hope that working with trees instead 
of partitions will provide the insight needed to develop a more general construction 
of Baranyai partitions. We use here another bijection found by Stanley (1999) Ex. 
5.43 to provide an alternative construction for the k = 2 case of Baranyai’s Theorem. 
Eventually one could describe our constructions without trees, but we think that 
trees give a strong motivation for them. We also prove that our construction differs 


from the standard construction shown on Fig. 7.7. 
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7.1 BARANYAI’S THEOREM 


In order to state Baranyai’s Theorem, we first need the following definitions. 
Definition 7.1. Given the set [n] := {1,2,...,n}, and k € [n] 
1. Gy is the collection of subsets of |n] having size k. 


2 ioe (al) is a k-partition of |n| provided the elements of S form a partition of 
[n]. 


Theorem 7.2 (Baranyai, 1975). If k divides n, then there is a partition of Cc) 
oT = k() rows, each of which is a k-partitions. We call this an (n, k)-Baranyai 


partition. 


Baranyai proved the existence of these (n,k)-Baranyai partitions for integers k 
and n with k|n. We now turn our attention to the case when k = 2 and work toward 


a constructive proof. 


7.2. BIJECTION WITH BINARY TREES 


Definition 7.3. A binary rooted tree ts a vertex rooted tree where each vertex has 
0 or 2 children, including the root. One that is labeled bears distinct labels for the 
leaves. If it is unordered, there is no ordering given to the two children of a single 
vertex. 

Let B, be the collection of labeled, unordered, binary rooted trees with n non- 
root vertices. Each will have €, leaves and l, — 2 non-root, non-leaf vertices. Thus 


n= ln + (ln — 2). The leaves will have label set {1,...,£n}. 


Stanley defines a bijection between B,, and the 2-partitions of [n]. The bijection 
is defined as follows: Start with T € B,. Extend the labeling to all of the non-root 


vertices by iterations of the following procedure. 
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Algorithm 7.4. Among the vertices which are unlabeled, non-root vertices with chil- 
dren which are both labeled, chose the one whose child has the least label of those 
considered. If labels 1,...,s have already been used, apply the label s + 1 to the 


selected vertex. 


This will result in a labeling of the whole tree with labels 1,2,...,n as exemplified 
in Fig. 7.1. Since we have a binary tree, we can create a 2-partition of [n] by pairing 
the labels of vertices which have the same parent. Because the tree is unordered, the 
2-partition will be unique. 

Likewise, each 2-partition can be described by a unique tree in B,. Induce an 
ordering on the pairs based on the maximum element in each. Start with a root 
vertex and two leaves. These leaves will obtain the labels from the first (largest) pair, 
i.e. the one containing n. Below the leaf with label n — 7, hang two new leaves with 
labels from the i + 2 pair in the ordering, starting with 7 = 0. Ignoring the non-leaf 
labels in the final product, we have a tree in B,. Mutatis mutandis, the bijection 


extends for non-binary trees. 


gs a 


NL LL 
\ ri 


Figure 7.1 A leaf-labeled binary tree which corresponds 
to partition 13]24|56|78. 


7.3 TREE CONSTRUCTION 


For even n, the goal is to describe a collection of 2(3) = n-—1 binary trees which 


correspond to an (n, 2)-Baranyai partition. The construction is inductive on n. Asa 
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a 


1 2 


Figure 7.2. The single 
binary tree for the 
(2, 2)-Baranyai partition. 


base case, when n = 2 there is only one binary tree (in Fig. 7.2) with leaves labeled 
{1,2} corresponding to the unique (2, 2)-Baranyai partition. 

Assume for all even m < n there is a collection of m — 1 trees from B,, which 
corresponds to an (m, 2)-Baranyai partition by the bijection described above. Recall 
that each tree in the collection will have m non-root vertices and £,, leaves labeled 
1,...,4m. Extending the labeling, the interior vertices take labels @,, +1,...,m. For 
ease of notation, let 7;, name this collection of m — 1 fully label trees. 

In the inductive step, we create 7,,, a collection of n — 1 trees from 6, whose 
corresponding partitions create an (n,2)—Baranyai partition. The construction is 


broken into two cases 


(Case 1) n = O mod 4 


(Case 2) n = 2 mod 4 


Case 1: n=0 mod 4 (ie. n = 27 where j is even) 

We require n — 1 binary trees with @, = 7 + 1 leaves and j — 1 non-root internal 
vertices. The method to extend labelings guarantees that each tree will correspond 
to a 2-partition of [n]. Because n is finite, it suffices to check that every element of 
(oh appears in some partition. The result will be an (n,2)—Baranyai partition. 

The collection 7;, will be the union of two sets. The first collection, 7,/, will consist 
of 7 — 1 trees which account for all the 2-subsets of [n] in which both elements come 


from {1,...,7} or both elements come from {j + 1,...,n}. The second collection 


T,”, of j trees will exhibit all the 2-subsets with one element from {1,...,7} and the 
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other from {j + 1,...,n}. Thus 7,! U 7,2 will be the desired collection of n — 1 trees 


in which all 2-subsets of [n| appear in exactly one tree. 


Creating T,,: Start with Jj, the collection of j—1 trees from the induction hypothesis. 
The non-root vertex set takes labels 1,...,7 while the leaves are labeled 1,...,@;. 
Increase each of the labels by 7. The vertex labels are now j +1,...,n and the leaves 


are labeled 7 + 1,...,7 + €;. Hang cherries (pairs of leaves) from the existing leaves 


with labels {j7 +2,...,j7+4,;}. This creates 2(¢; —1) = j new leaves which need labels. 
Let T} name this collection of partially labeled binary trees. Fig. 7.3 demonstrates 


the process on a tree from 74. 


Figure 7.3 The left tree is from 74. Adding 4 to each of 
the labels yields the middle tree. Finally we hang cherries 
from 3 of the leaves to obtain the right-most tree in 7]. 


Each T” € 7; was constructed from a tree T € 7;. For the 2-partition that 
corresponds to TJ’, induce a well-ordering on the pairs according to their least element. 
The pair containing 1 will be the first (least) pair. We then use these pairs to label 
the unlabeled leaves of T’. Specifically, the ith pair will label the children of the 
vertex labeled 7 + 7+ 1. Labeling in this order respects the algorithm for extending 
labelings. 

By construction, this collection of 7 — 1 trees accounts for all of the ce ) 2-subsets 


2 


of {1,...,7} and all of the (3) 2-subsets from {j + 1,...,n} with i of each type in 


each tree. 
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Creating T,?: We utilize j caterpillar binary trees (as seen in Fig. 7.4) to account for 
the pairs in which one element is from {1,...,7} and the other is from {j+1,...,n}. 
On every tree, apply label 7+1 to one of the leaves farthest from the root. Necessarily, 
the internal vertices are labeled 7 + 2,...,n increasing toward the root. 

For the complete bipartite graph with vertex classes {1,...,j} and {j+1,...,n}, 
we can partition the edges into 7 perfect matchings since the graph is regular. Using 
one matching for each caterpillar tree, label the vertices so that pairs of vertices 
with the same parent are exactly the pairs in the matching. This completes the 


construction of 7,?. 


/\ re 


NLS 
NA Ly 
NLS 


Figure 7.4 The left tree is the general 
caterpillar tree. The right tree corresponds to 
the matching 35|26|47]18. 


Between 7,' and 7,?, we have constructed a total of 7 -1+ 7 =n—1 trees with 
each pair from co appearing in exactly one tree. This completes our construction 
of 7, when n = 0 mod 4. 

Case 2: n=2 mod 4 (ie. n = 27 where j > 1 is odd) 

We need n — 1 trees from B,, each with 0, = 7 +1 leaves and j — 1 non-leaf, 
non-root vertices. As in Case 1, this is done in two steps. First, we create 7,! 
with 7 — 2 trees to account for all pairs from {j + 2,...,n} and all but 7 + 1 pairs 


from {1,...,7 +1}. The missing 7 + 1 pairs make two 2-partitions P, and P, of 
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{1,...,j7 +1}. Second, we create 7,? to account for the pairs from P, and P, in 
addition to the (j + 1)(j — 1) pairs in which one element is from {1,...,7 +1} and 


the other element is from {7 + 2,...,n}. 


Creating T,|: Because j — 1 is even, the induction hypothesis gives the collection Tj_1 
of 7 — 2 trees, each with ¢;_; leaves. We now modify these trees to create trees with 
nm non-root vertices. 

On each tree of 7;_;, increase all of the labels by j + 1 so that the new labels 
are 7 +2,...,n. At the end of each leaf, hang a pair of leaves. This yields exactly 
Dh ihe 2(+ +1) = 7+ 1 new leaves for a total of n non-root vertices. Call this 


collection of 7 — 2 trees Tj-1- Figure 7.5 represents the procedure for n = 10. 


ee ee ee 
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Figure 7.5 The left tree is from 74. The middle tree 
results from increasing the label values by 6. The right tree 
is the outcome in 7 after hanging 3 pairs of new leaves. 


It remains to label the 7 + 1 new leaves. Because 7 + 1 is even, the induction 
hypothesis supplies 7;,1 corresponding to an (n,2)—Baranyai partition. Let T,, and 
T, be two arbitrarily chosen trees in 74; with corresponding partitions P, and P). 
Fix any bijection from 7j_, to Tj41 \ {Ta, 7p}. Consider an arbitrary pair (7", 7) in 
the bijection, J” € Tj-1 and T’ € 7j4;. Induce a well-order on the pairs associated 
with T according to the least element in each pair. Now in TJ”, label the children of 
the vertex labeled 7 + 1 +7 with the values in the 7th pair where the pair containing 
1 is the first pair. This algorithm results in a labeling of all the leaves of J’ that 


respects the method for extending labelings. 
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The construction of 7,! is complete with j — 2 trees containing (75') pairs from 
n 2 
{j+2,...,n} and (j — 2) pairs from {1,...,7 +1}. The remaining 7 + 1 pairs 


from {1,...,7 +1} are those in P, and P;. They will be accounted for in 7,?. 


Creating 7,2: The remaining pairs will be realized in j + 1 caterpillar binary trees. 
The internal vertices will necessarily be labeled 7 + 2,...,n, increasing as we move 
closer to the root as in Fig. 7.6. 

We can use matching theory to label the leaves of the caterpillar trees. First, 
create a complete bipartite graph with vertices {1,...,7 +1} on the left and vertices 


{j+2,...,n,x,y} on the right. 


Figure 7.6 The left caterpillar is the general layout for n = 10 
with leaf labels needed. The middle bipartite graph shows a 
perfect matching which extends the matching {3, x}, {1, y} for 
{1,3} in P. The right tree is the caterpillar corresponding to 
the matching. 


For P = P,UP,, start with a single pair {k, 2} € P. Match k with x and @ with 
y. The remaining graph is regular bipartite, so we can extend our matching to a 
perfect matching M which will determine the leaf labels of a single caterpillar. For 
alli € {j7 + 2,...,n}, the vertex matched with 7 will have the same parent as 7 in 
the caterpillar. The vertices matched with x and y will label the two leaves farthest 


from the root. 
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Removing the edges of MW from the bipartite graph, regularity is maintained. So 
we repeat the procedure to obtain the next caterpillar tree. You may notice that each 
value of {1,...,7-+1} appears in exactly two pairs of P. So for {@,m} € P, @ will be 
paired with x since it was previously paired with y. Thus there will be no problem 
with representing all pairs of P. Once the bipartite graph has been decomposed into 
j +1 matchings, we will have our j + 1 caterpillar trees, completing our construction 
of 7,?. 

With 7,/ and 7,7, we have constructed (j — 2) +(j +1) = 27 -1=n-—1 trees for 


the (n,2)-Baranyai partition. 


7.4 NEW (n,2)—BARANYAI PARTITIONS 


The easy known algorithm to create (n,2)-Baranyai partitions is as follows: 


Algorithm 7.5. Choose one element of {1,2,...,n} to be the center. Display the 
remaining n —1 values equidistant in any order on a circle around the center. The 


procurement of the partitions is best described first with an example. 


In Fig. 7.7, we have 8 in the center with {1,2,...,7} in increasing order around 
the circle. Connecting 8 with 1, the remaining integers are paired as in the left 
diagram to create 2-partition {{1,8}, {2,7}, {3,6}, {4,5}}. By connecting 8 with 
2 and matching the remaining integers in a similar way, we obtain the 2-partition 
{{2, 8}, {1, 3}, {4, 7}, {5, 6}}. Continue to rotate the pairing configuration around 
the circle to obtain 7 different 2-partitions which contain each element of (3) exactly 


once. This is an (8, 2)—Baranyai partition. 


Algorithm 7.6. In general, to obtain a partition into 2-sets, match the center with 
an arbitrary entry on the circle, and after that match those pairs whose connecting 


line is perpendicular to the radius connecting the center to the selected entry. 
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Figure 7.7 A circular representation to find 
an (8, 2)-Baranyai partitions (Lint and Wilson 
1996). 


Claim 7.7. The construction developed in this paper does not arise from the circular 
construction described here for any 4 <n 4 10, for any relabeling the elements on 


PYG TY. 


Proof. We will break this proof into two cases according to the two cases for the tree 
construction. 

Case 1: When n = 0 mod 4, n = 2, represent the elements from {1,...,7} with 
color blue, b, and the elements from {j + 1,...,2} with color red, r. Note that blue 
and red occur the same number of times. Recall each 2-partition in 7;, from the tree 


construction is one of the following types: 


Type I: All pairs have one element from the set {1,...,j7} and one element from 


{j+1,...,n}, ie. all pairs are of the form rb. 


Type II: Half of the pairs have both elements from {1,...,7} and the other half of 
the pairs have both elements from {j + 1,...,n}, i.e. half of the pairs are bb 


and the other half are rr. 


For contradiction, assume there is a circular representation of {1,...,n} whose 
corresponding 2-partitions exactly match those of 7. 
At this point, we assume the integer in the center is red. (Identical arguments 


work if the integer in the center is blue.) It is also safe to assume that there are two 
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consecutive positions on the circle, one red and the other blue (here we use n > 2). 


The following deductions can be seen Fig. 7.8. 


r ba 
7) 
C3 
r ey 


Figure 7.8 For Case 1, this shows the steps to determine the colors 
on circle. 


The 2-partition that arises when the center r is paired with the b must be of Type 
I. In this same partition, r will be paired with c,, forcing it to be blue because this 
is Type I. 

Now let’s consider the next 2-partition obtained by pairing the middle r with blue 
cy. This again puts us into Type I. Based on the circular arrangement, c2 will be 
paired with b and thus must be red. 

Looking at one more 2-partition, where the middle r is paired with red co, we 
find ourselves in Type II. Because blue c; will be paired with c3, it must be blue. 
Similarly, c, is blue. 

The pattern rbbrbbr ... repeats around the circle. However, half of the elements 
should be red, posing a contradiction. 

Case 2: When n = 2 mod 4, n = 2), color the elements of {1,...,7 + 1} blue, 8, 
and the elements of {7 + 2,...,n} red, r. This time, there are two more blue than 


red. Each 2-partition of [n] in the tree construction is one of the following types: 


Type I: There is one pair with both elements in {1,...,7 +1} and the rest have one 
element from {1,...,j7 +1} and one element from {j + 2,...,n}, ie. one is bb 


and all others are rb. 


Type II: All pairs have either have both elements from {1,...,7+1} or both elements 


from {j + 2,...,n}, ie. all pairs are rr or bb. 
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Because there are 2 more blue integers than red, we must find the sequence rbb 
somewhere on the circle. This time we split the problem into two cases based on the 
color of the middle integer. 

Case 2/Red: Suppose the middle integer is red. When the middle r is paired with 
the r on the circle, as in Fig. 7.9, we have a Type II partition. Thus d, and dz are 
blue. Now c, could be red or blue. So we consider the two possibilities separately. 


pa oe no 


b = dy C2 
r C3 
\ se 
Figure 7.9 In Case 2/Red, we start 


with a red middle. By pairing r with 
r, we obtain more color information. 


If c, is blue, then pairing the middle r with the b as in Fig. 7.10, we find ourselves 
in Type I. There is already a bb pair, so Cy is blue and c3, c, are red. However, when 
we pair the middle r with the next b we find an rr in the same 2-partition which is 
not possible in either Type I or Type II. Thus we have a contradiction. 


b bs Db 
b ( Co=b b <a 


i c3 =7T wT 
C4 = PT T 


Figure 7.10 When c, is blue, we obtain a 
contradiction. 


bb 


If c, is red, then pairing the middle r with c, puts us in Type II. Thus cy and 
cz are blue while cq is red as in Fig. 7.11. Next, pairing the middle r with c4 puts 
us in Type II again and continues the rbbrbb pattern around the circle. Repeating 
this procedure, we see that there are twice as many blues as reds on the circle which 


contradicts the size of the red and blue sets for n ¥ 10. 
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Figure 7.11 When c, is red, these are the 
deductions made. 


Case 2/Blue: When the middle integer is blue, the color of c; is not yet determined. 


So we consider each color possibility separately. 


Figure 7.12 The set-up 
for Case 2/Blue. 


If cy, is red, pair the center b with the two blues on the circle, one at a time as in 
Fig. 7.13. Both are Type I with the pair containing the center as the exceptional bb 
pair in each. All other pairs in these will be rb. The first forces d, to be blue while 
the second makes cz blue. Using this new information, return to the first pairing to 
see dz is red and the second pairing to see cz is red. In this way, we find that the 


colors alternate around the rest of the circle. 


b b rbo 
a ae a i } r 
2 C2 2 ( @Q= 
ds b C3 ds b C3 p~ : ) 
dy C4 ee b e ae 


Figure 7.13 When c, is red, the first two pairings work 
alternately to obtain the configuration on the right. 


Connecting the center b to the red on the opposite side of the circle, as in Fig. 


7.13, we find a single partition with both rb and rr which poses a contradiction. 
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If, on the other hand, c; is blue. Pairing the center 6 with the middle blue on 
the circle (left picture of Fig. 7.14) puts us in Type II because there will be two bb 
pairs. Therefore cz is red. Next, pairing the center b with the rightmost blue puts 
us in Type I because there is an rb pair. Since there is already a bb pair, all the rest 
must be rb. Therefore c3 will be red and cy will be blue. 

Back to the first pairing, we can conclude d, is red. And the second pairing gives 
cs is blue. However, as in the last picture of Fig. 7.14, there is now a pairing with an 


rb and two bbs which is neither Type I nor Type II posing another contradiction. 


r ab r b b r 0 b 
dy i ~b dy aA r i 7 b 
‘C= “oT eet 
bog Y @=r ov’ r 
3 3 \ 
C4 C4 = b oe 6, 
C5 C5 b 


Figure 7.14 When c, is blue, these conclusions can be made. 


This completes the proof that the 2-partitions constructed using the tree algorithm 


are distinct from those obtained by the circular arrangement algorithm. rT 


7.5 CONCLUSION 


There are many different bijections between trees and partitions. As we continue 
on this project of constructing Baranyai partitions, we explore other bijections in a 


search for constructions which could be extended for larger values of k. 
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APPENDIX A 


METHOD FOR SELECTING THE BINARY STRINGS IN C; 


For a D3CNF I’, Subsection 2.1.1 defines a set C; of 50 binary strings that had number 
of useful properties needed to encode clauses of J’. In this section, we explain the 
method by which we selected these 50 strings. 

Fix I’ with k clauses and n literals. Fix a clause c; = vg V ug V vy in I’ with 
a#BA#Ax7#a. The goal is to define a multiset of binary strings C; = {vi it, taken 
from the set 

3 = {0,1} 
with coordinates 


(x1, Y15 V2, Y2,-+ +5 Un, Yn, €1, €2,---; €;) 
so that the multiset has the following properties: 
Li MGA 5 eet a a0, 1 OY 
2. For any pp’ € Mi, and 7,7! € M'\ M{,, then 
{H(u,v8) : 5 € [ml} = {A (u,v) : 7 € ml}, 
{H(n, vj): 9 € [m]} = {A (n!,v%) +9 © [m}, 
{H (1,5) 9 € [m]} A {A (n, v4) 27 € [m]}. 
If we can find the appropriate strings for a clause with 3 positive literals, then we can 
adapt these to make strings for clauses with negative literals as explained in the last 


few paragraphs of Section 2.1.1. For the remainder of this appendix, we restrict our 


attention to a clause c; with three positive literals. 
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For vt € C;, define 


Vi[Si] = (Yj[to], Yilyol Yjlea], Yilyal, Yjley1, ¥j[¥4])- 


For a median p € M'(vi,...,v*,), define p[S;] similarly. 

We say that two strings 7 and 7 are complementary on the first 2n coordinates if 
nixj| = 1—7l[x,] and n[y;] = 1—7[y;] for each i € [n]. To achieve our first goal, strings 
will be added to C; in pairs which are complementary on the first 2n coordinates. 
Thus x; and y; will be ambiguous coordinates for all 7 € [n]. The e; coordinates 
will be for additional ones, so ple;| = 0 for all  € M and all j € [t]. Therefore 
NA SSO OF. 

For any wp € M’, lx] A uly,] for each 7 € [n], so u[S;] € {01,10}*. The eight 
possible 6-bit strings can be visualized on the 3-dimensional cube. Label the vertices 
with these strings so that the Hamming distance between two strings is precisely twice 
the graph distance between the vertices they label. This is represented in Figure A.1. 
We often use the representation on the right in Figure A.1. 

In Definition 2.1, we defined a bijection between M’ and truth assignments. The 
clause c; is not satisfied if all of its variables are false. Since c; has 3 positive literals, 
this truth assignment corresponds to uw € M’ with y[S;] = 010101. But if u[S;] is any 
of the other 7 tuples then the corresponding truth assignment satisfies c;. Let uy be 
the vertex labeled 010101. 

Using uy as a reference point, we say that the height of a vertex is its graph 
distance from uy. In particular, the vertices labeled 100101, 011001, or 010110 have 
height 1. We will also refer to the medians pp € M’ with p[5S;] equal to one of these 
as having height 1. The vertices labeled 101001, 100110,011010 have height 2 and 
the medians pp € M’ with p[S;] equal to one of these three 6-bit strings have height 
2. Medians p € M’ with pu[S;] = 101010 have height 3 while medians np € M’ with 
u[S;| = 010101 have height 0. 
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uf 


Figure A.1_ A labeling of the 3-dimensional cube 
with the possible values of y[5;]. The vertex uy will 
correspond to the medians jo with pu[.S;] = 010101. 
We use the figure on the right to represent the cube. 


Next we work toward the second goal of distinguishing the multiset of Hamming 
distances for medians in M/,, from the multiset for medians in M’\ M/,. To simplify, 
we make the restriction that each v% € C; will be taken from the collection S’ ™ S’ 


where 
S' = {n € S: nlx] = nlye] = 9 VE € [n] \ {a, 8, y}}, 
S = {7 € 8: qlee = nly] =1 Vee [nr] \ {a, 8, y}}- 


For wp € M’ and vi € S'S", for each ¢ € [n] \ {a, B, 7}, 


A((v5 [xe], vi [yel), (u[xe], w[ye])) = 1. 


This is because v%[a~] # vi[ye]) while [xe] = pyc]. Consequently, if vi has e(v4) 


additional ones, then 
A (vj, u) = (n— 3) + H(v;[Si], u[Si]) + e4). 


Each 7 € S has a string 7’ € S’ which is complementary to it on the first 2n 
coordinates. (The choice of 7 is not unique.) Following the decision that the strings 
in C; will be in pairs which are complementary on the first 2n coordinates, we will 
select our strings from S$’ for C; and then include a corresponding string from S’. 

Now we analyze the strings from S’ to inform our decision on which ones to include 


in C;. In order to characterize a string vt € S’, we only need to specify v [S;] and the 
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Table A.1 Hamming distances when 
n € N, with n[x.| = 0 and niyo] = 1. 


LS) 
010101 
100101 
011001 
010110 
101001 
100110 
011010 
101010 


= 
= 
w 
= 
w 


AN RAND A DIE 


number of additional ones it will have (see Definition 2.4). Aside from the number of 
additional ones in each string, there are only 2° possible choices for v‘[Sj]. 
Partition S’ into sets No, Ni, No, N3 defined here. For each i € {0,1,2,3} and 


n € S’, n € N;, precisely when 


{9:9 € {a, 8, y} and y[xj] # n[y,]}] = ¢. 


We will consider the sets No, Ni, No, N3 individually. 
Set No: For € € No and wp € M’, H(E[Si], u[S;]) = 3. This is because plx,;] 4 ply;] 
while €[x;] = €[y;] for each j € {a, 8, y}. Because no distinction is made between the 


median in M’, the strings in No will not be useful in accomplishing our second goal. 
Set Ni: For 7 © Ny and w € M’, A(n[Si], u[Si]) € {2,4}. In particular, if n[r.] does 


not equal [yo], then A((u[x,], ulys]), (mlz), nly;])) = 1 for 7 € {8,7} and either 
e u[ro| = n[xo] and u[yal = nlyo] in which case H(n[S;], u[S;]) = 2, or 
© L[%a] # MCo] and [Ya] # nlYa] im which case H(7[Si], u[Si]) = 4. 


The values are detailed in Table A.1. 
We can display these Hamming distances on the cube representation of medians 


so that the vertex representing p[.S;] is labeled H(u[S;], n[S;]). For each 7 € {a, 8,7}, 
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Uf Uf 


A(ng|Si], uLSi]) A(n,[Si], wLSi]) 


Figure A.2 For j € {a, 8, y}, the values of H(n;[Si], u[5Si]) 
are displayed on the cube representation of medians from 
Figure A.1 . 


let n; € N, be the strings with 7,;{7;| = 0 and 7,[y;] = 1 and no additional ones. Set 


Ni = {na 18, Ty}. 


There are three cubes drawn in Figure A.2, one for each of the strings in N with 
vertices labeled by the value of H(,u[.S;], 7[Si]) for 7 € Ni. The left-most cube corre- 
sponds directly to Table A.1, the value of H(u[5S;],o[S;]) labeling the vertices. 


Consider the multiset 


Ay (uw) = {A (uh Si], nol Si), ACebSi], nal Sil), A (el Si], my LSe])}- 


Notice that all medians with height one have H,(u) = {2,2,4} and all height two 
medians have H,(w) = {2,4,4}. We can display these values on the median cube. 
Since vertices of the same height have the same value for H;, we write each multiset 
only once. The cube is drawn in Figure A.3. 

For each 7 € Nj, define 7 to be the binary string which is complementary to 7 on 


the first 2n coordinates and has no additional ones. Set 


Ni := {fa 78,7}. 


For Hi(y) = {H(ulSi), nal Si]), (uli mal Sil), H(ulS:], m[5%])}, we label the cube on 
the right in Figure A.3. 


Notice, however, for all pp € M’, 


A, Ay = {A (ulSi], n[Si]) :7 € MUN} = £2, 2, 2,4, 4, 4}. 
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{4, 4,4} {2, 2,2} 


{2,4, 4} {2,2, 4} 
{2, 2,4} {2,4, 4} 
a oy a "Y 
{2, 2,2} up {4,4, 4} 
{H(n[Si], u[Si]) 217 © Ni} {H(n[Si], u[Si]) +7 € NG} 


Figure A.3 The left cube displays the values of H,(,1) 
and the right cube displays the values of H1(1). 


(7,7,7} (2,2,2} {2,2,2,7,7,7} 
(5,7,7} (2,2, 4} {2,2,4,5,7,7} 
(5,5, 7} : (2,44) {2,4,4,5,5,7} 
: ¥ ; (5,5, 5} {44,4} {4,4,4, 5,5, 5} 
(H(n{S],o(8) 2 {H(mlSa (84): 
ne Ni} neM} 


Figure A.4 The values {H(n[S,], u[Si]) :n €. NV} w A (u) 
displayed on the median cube. 


So MN, UN, acts like an identity. 

Now if we give some additional ones to each of ja, 7g, 7, then we increase the Ham- 
ming distances by that amount and obtain a more useful collection. For example, if we 
add 3 additional ones to each string in NV; to make the set N{**) = {n*), ie nt}. 

Recall that for any wp € M’ and 7 € S’, with e being the number of additional 
ones in V, 


H(y1,v) = H(u[Sj}, v(S,]) + ev) + (n— 3). 


Therefore H(n*°[S;], u[.Si]) = H(na[Si], u[Si]) +3. The values of the multiset 
{A (ol Si], Si) 29 NY} w {A (alSi), wlSi]) 7 < Na} 


are displayed in Figure A.4. 


Set No: Now we turn our attention to the set No. For ¢ € No and wp € M, 
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{5,5, 5} Frost 


{3, 3, 5} fi.33h 
{1,3, 3} {3,3, 5} 
Q Y Qa Y 
ee y  15,5,5} 
A(uLSi], Ca,a[Si]) {H(u[Si], C[Si]) : ¢ € No} {H(u[Si], ¢[Si]) + ¢ € No} 


Figure A.5 The left cube displays the Hamming distances H (Si, Ca,s[Si]). 
The middle gives H(j1) and the right cube displays Ho(1). 


ACS], u[Si]) € {1,3,5}. For example, if ¢ is a string in Ng with ¢[xa] ¥ ¢[ya] and 
¢{xa] # Clya], then the following hold: 

e If u[xq] = C[xq] and pu[xg] = ¢[xg], then A(¢[S;], u[S;]) = 1. 

e If p[za] = ¢[za] and ulxg] 4 ¢[xg], then H(¢[Si], u[S;]) = 3. 


e If pra] # ¢[za] and pulxg] = ¢[xg], then A(¢[Si], u[.S;]) = 3. 


e If uta] A ¢[zq] and p[x,g] A C[xg], then A(C[S;], u[S;]) = 5. 


For each pair {7,2} C {a,6,y}, j  &, let Gje be a string in Nj which satisfies 


¢e[rj] =0 = 1— Gey] and Cj e[ze] = 0 = 1 — Cj [ye] and no additional ones. Let 


No = {Ca,85 Gayys Ca,y}- 


The Hamming distances of medians with these three strings are displayed on the 
three cubes in Figure A.5. 

Define the multiset Ho(w) = {H(¢[Si], u[Si]) : ¢ € No}. The medians at height 1 
on the cube all have H2(j) = {1,3,3} and the medians at height 2 on the cube all 
have Ho(1) = {3,3,5}. This is displayed on the middle cube in Figure A.5. For each 
{j,£} C {a, 6,7}, define 77¢ to be the binary string which is complementary to 4 on 
the first 2n coordinates and has no additional ones. The right cube of Figure A.5 


displays the Hamming distances between ju[.S;] and the strings in 


N2= {Ca.8s Cavs Cay} : 
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Table A.2 The 8 different sets of 3 strings in Nz and their Hamming distances 
with medians in M’. 


115i] \uy | 101010 | 101001 | 100110 ] 011010 | 100101 ] 011001 | 010110 | 010101 
101010 | {1,1,1} | {1,3,3} | {1,3, 3} | {1,3, 3} | 3,3, 5} | {3,3,5} | (3,3, 5} | (5,5, 5} 
101001 | {1,3,3} | {1,1,1} | {3,3,5} | {3,3,5} | {1,3,3} | {1,3,3} | {5,5,5} | (3,3, 5} 
100110 ‘| {1,3,3} | {3,3,5} | {1,1,1} | {1,3,3} | {1,3,3} | {5,5,5} | {1,3, 3} | (3,3, 5} 
011010 ~—‘| {1,3,3} | {3,3,5} | £3,3,5} | {1,1,1} | {5,5,5} | {1,3,3} | {1,3, 3} | (3,3, 5} 
100101 | {3,3,5} | {1,3,3} | {1,3,3} | {5,5,5} | {1,1,1} | {3,3,5} | {3,3,5} | 1,3, 3} 
011001 ~—‘| {33,5} | {1,3,3} | £5,5,5} | {1,3,3} | {3,3,5} | {1,1,1} | {3,3,5} | {1,3,3} 
010110 ~—| {33,5} | {5,5,5} | {1,3,3} | {3,3,5} | {3,3,5} | {3,3,5} | {1,1,1} | {1,3, 3} 
010101 ~—‘| {5,5,5} | {3,3,5} | {3,3,5} | {3,3,5} | {1,3,3} | {1,3,3} | {1,3,3} | {1,1,1} 


The left-most column lists the 8 possible strings y[.S;] for p € M’. In the column 
headers, the 6-bit string is the vertex which acts like uy in the corresponding 
rotation of the middle cube in Figure A.5. The entry in row 7, column £, is the 
multiset of Hamming distances between ju[S;] and each of the three strings in 
Nz corresponding to the cube rotation. For example, the column with heading 
101010 corresponds to the right cube in Figure A.5. 

This time, H,(j2)¥ H, (1) does not result in the same value for all ps € M’ so we do 
not have an identity. Instead, we require 8 different rotations of the middle cube in 
Figure A.5 to make an identity. The right-most cube is one rotation with the 101010 
vertex acting like uy. Figure A.2 lists the 8 rotations. Each column corresponds to a 
rotation with the column heading telling the label of the vertex from Figure A.1 which 
acts like ur. There is a row for each ju[S;], 1 € M’. The entries in the table indicate 
the multiset of Hamming distances with p|S;] in the corresponding cube rotation. 
For example, the column with heading 101010 corresponds to the right-most cube 
in Figure A.5. The column with heading 010101 corresponds to the middle cube in 
Figure A.5. If we use all 24 strings (3 strings from each of the 8 rotations), the union 
of the Hamming distance multisets in a single row of Figure A.2 is {1°,317, 5°} for 


any row. This multiset Z of 24 strings is an identity since it does not distinguish 


medians in M’. 


Set N3: Let 7 be a binary string in N3. Then for any « € M’, the Hamming distance 
A (u[S;], 7[S;]) will be in {0, 2, 4,6}. This comes from the fact that H(u[S;], 7[S;]) = 2¢ 


whenever [{j € {a, 8,7} : lei] # rle}} = 
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Figure A.6 The Hamming distances between 
a median on the cube and string 7 with 
Nas 


Let 7 be a string in N3 with t[x,| = T[xg] = T[v,] = 0 and no additional ones. 
All medians of height 1 have H(u[S;],7[5;]) = 2 and all medians of height 2 have 
A(u[S;], 7[S;]) = 4. This is displayed in Figure A.6. Let 


N3= {rt} Ng = {7}. 


Once again, we will need all 8 rotations of this cube to make an identity. This 


identity will consist of exactly 8 strings. 


We have now defined 6 sets of strings, Ni,No,.V3 and their complements. For 
each j € [3] and p € M’, define 


H,(u) := {A (u[Si], v[Si]) + ev) : v € NG}. 


We have already seen that for any pu, p’ € M’, both of height 1, H;(u) = H;(w’). 
Also, for any p, yp’ € M’, both of height 2, H; (4) = H;(w’). 

For j € [3] and non-negative integer e, let Nn be the set of strings in NV; with 
e additional ones added to each string. The set Nj *) is defined similarly. 

The goal is to find a multiset C! of N4,N2,N3 and their complements, possibly 


with additional ones added to each, such that 


e there is a one-to-one correspondence between the sets of the form Net and 


the sets of the form Noe and 
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e there is a method to take unions and set subtractions of the multisets 
{H(HlSi, 1S): ENG} w {CS o1Si]) Nj} 
for each NOY € C; so that for any medians p, py’ € M’ of height 1,2, or 3 on 
the median cube and jig € M’ with jo[S;] = 010101, then 
* Hy(p) = Hy (qu!) and 
4 Hija) # H(t). 
One possible multiset is C) = {Mit Noe No fakes NOP. ne | with 
(as G wi”) \ (nse? . Na*”) ‘ (nse ig Ns'*??) . 


The resulting Hamming distances are displayed in Figure A.7. 


{7,7,7} {2,22} {77,7} {22,23 {7} £2} (27 
{50 7} {22,44 {ORE {2,4, 4} {5} {4} 7 awe 
{5,5,7} ° {2,4,4} »{3,5,5} {4,46} {3} ~ {6} {2,7} 
{5,5,5} {4,44} {3,3,3} {66,6} {1} {8} {2,7} 


Uf 
Ce a4 Hat) HW?) " Hy )o (Hg Hy) 


Figure A.7 A selection which distinguishes the Hamming distances at uy from 
the rest. 


Ultimately, we need a union of strings that will accomplish the same final goal, 
without the use of set subtractions. However, we can easily modify the collection in 
Figure A.7 to remove the set subtraction by essentially adding the identities Tt 
and Eo The first one contains Nt) as a subset, so jie \ ir) is a collection 
of 21 strings. Likewise, pave contains Kee. 


Let 
ee (wg ig wi”) WJ (age? is Ns*””) 


in (a \ ng?) ei a \w)) . 
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This is a collection of 50 strings such that for any uw € M’, yw of height at least one 
in the median cube, 
{H(p[S,],v[S,]) + e(v) : v € Ci} = {3°, 5", 7°} U {2° 47, 6°} U {2, 7} 
= {9! 36 4i2 pi2 6° eo. 
On the other hand, if jp € M’ has p[.S;] = 010101, then 
{H(u[Si], v[Si]) + e(v) : v © Ci} = {3°, 57, 77} U {2°, 4, 6°} w {1, 4°, 5°, 8} \ {3°, 6°} 
So OAD ART ss ae Ora deo 
By adding n — 3 to each of these values, we obtain the Hamming distances H(p, v) 
which are listed in Table 2.1. 


Table 3.1 details a different set of 26 strings strings for clause c; in exactly the 


same way as they are explained for Table 2.1. The conclusions are also the same. 
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